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Einstein's general relativity (GR) is a dynamical theory of the space-time metric. We 
describe an approach in which GR becomes an SU(2) gauge theory. We start at the 
linearized level and show how a gauge-theoretic Lagrangian for non-interacting massless 
spin two particles (gravitons) takes a much more simple and compact form than in the 
standard metric description. Moreover, in contrast to the GR situation, the gauge theory 
Lagrangian is convex. We then proceed with a formulation of the full nonlinear theory. 
The equivalence to the metric-based GR holds only at the level of solutions of the 
held equations, that is, on-shell. The gauge-theoretic approach also makes it clear that 
GR is not the only interacting theory of massless spin two particles, in spite of the 
GR uniqueness theorems available in the metric description. Thus, there is an inhnite- 
parameter class of gravity theories all describing just two propagating polarizations of 
the graviton. We describe how matter can be coupled to gravity in this formulation 
and, in particular, how both the gravity and Yang-Mills arise as sectors of a general 
diffeomorphism-invariant gauge theory. We hnish by outlining a possible scenario of the 
ultraviolet completion of quantum gravity within this approach. 
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Since the mathematicians have invaded the theory of relativity, I do not understand it myself 
anymore. 

Albert Einstein 



1. Introduction 

This expository paper is about an approach to the theory of general relativity 
(GR) that departs rather far from how its initiator Albert Einstein thought 
about the subject. The present author is a physicist by training but working 
in a mathematics department, and so it is quite possible that Einstein would be 
able to say about this work what he said on a different occasion and what is 
quoted above. The approach we shall describe makes a conceptual jump when 
compared with Einstein's views on the nature of the gravitational force. Thus, in 
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Einstein's GR, the gravitational field is encoded in the geometrical properties of 
the space-time, and the geometry itself is described by the space-time metric. The 
approach described here will still be geometrical in spirit. However, the geometry 
will be that of gauge fields on the space-time manifold, not that of a space-time 
metric. No metric will be present in the formulation of the theory, and it will only 
appear later as a derived concept. 

Accepting for the moment that such an approach is indeed possible, a natural 
question to ask is if it is worth considering any such reformulation of GR. Indeed, 
it is clear that it would require drastic changes to the century-old metric-based 
intuition about the subject. Such conceptual changes can only be justified if 
they can help to solve problems that are difficult to address in the traditional 
framework. As we now briefly review, there are indeed at least two major 
difficulties with GR. 

The first problem of GR is that of the unification with the other forces. Thus, 
Einstein's theory was developed already almost 100 years ago, in particular, before 
the theory of quantum mechanics was discovered. And during the last century, 
our understanding of the behaviour of the micro-world has culminated in the 
discovery that almost all interactions in Nature (apart from gravity) are described 
by (quantum) gauge fields. Thus, it is now believed that (to the energy scales of 
the conjectural grand unification) the electromagnetic, the weak and the strong 
interactions are all described by the Yang-Mills theory of the (spontaneously 
broken) Standard Model gauge group U(l) x SU(2) x SU(3). In other words, at 
least to a very high energy scale, the gauge fields appear to be fundamental. 
Only the gravitational force stands far aside from this compelling pattern. Using 
the technical jargon, we say that gauge fields describe particles of spin 1, while the 
gravitational interactions are carried by particles of spin 2, and this is where 
the difference comes from. At the same time, if it was possible to reformulate 
the theory of gravity in the language closer to that of gauge theories, we would 
possibly come closer to understanding how gravity fits together with the other 
forces of Nature. To put it differently, it is not clear that Einstein himself would 
be thinking in terms of the space-time metric if he was trying to invent his theory 
of gravity nowadays, with our present solid appreciation of the importance of the 
gauge principle. 

The other problem of GR is related to the quantum properties of the 
gravitational field. These do not seem to be of any importance at energy 
scales accessible to our observations or experiments. At the same time, they 
can be studied theoretically and reveal that gravity is again unlike all the 
other forces. For other forces, the quantum vacuum polarization effects lead to 
the strength of interactions becoming energy scale-dependent (the phenomenon 
of renormalization) . Yet, the form of the field equations (or the Lagrangian) 
describing these interactions does not change with the energy, only the coupling 
constants (and other physical parameters, e.g. masses) flow with the scale. One 
says that the theories that form the standard model of elementary particles are 
renormalizable. Gravity is not like this. Not only the parameters of its Lagrangian 
flow with the energy, but also the form of the Lagrangian itself must be modified 
once the quantum corrections are taken into account. One says that gravity is 
non-renormalizable. What this means is that we know for certain that Einstein's 
theory of GR cannot be a fundamental theory, for at very high energies, it is 
modified (by quantum corrections) in a way we cannot currently control. 
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A possible way out of the abovementioned difficulties is to imbed both gravity 
and the gauge-fields in a different theory that resolves these problems. This is, 
for example, what string theory attempts to do. However, the string theory, at 
least in the way we currently understand it, appears to be far from being an 
economical solution of the problem, as it requires supersymmetry as well as many 
extra unseen dimensions. It is not impossible that a more economical solution to 
the problems of gravity could exist. 

As we shall explain in this article, the present gauge theory approach may be 
of help with both the problem of unification with other interactions as well as the 
problem of the quantum behaviour. We emphasize, however, that neither of these 
two goals has been achieved as of yet , there are only hints (to be described) that 
this may be possible. A sceptically minded reader may then conclude that in the 
absence (as of yet) of any real applications, the approach presented here is not 
worth the trouble learning, for it requires developing a completely new intuition 
(gauge field-based) about the gravitational force. 

At the same time, we note that in the past, science has often progressed 
by first reformulating a known physical law in a non-trivially equivalent form, 
and then this non-trivial reformulation suggesting a generalization leading to a 
new physical theory. For this author, the most striking example of this historical 
phenomenon is the discovery of quantum mechanics. Indeed, at least in its first 
formulations, it was so essentially built on the Hamiltonian formulation of the 
classical physics. Thus, it appears to be profitable to reformulate known physical 
laws in a form that is non-trivially equivalent to the original formulation. This 
way we not only learn more about the theory at hand, but also potentially 
step on the path to new physics. We therefore proceed with our development 
of a gauge-theoretic description of gravity having at least this historic lesson as 
the motivation. 

As for a mathematically minded reader, we motivate our constructions by the 
following remarks. In spite of Yang-Mills theory and gravity being so different in 
spirit — the former is about connections in principal G-bundles while the latter is 
about metric-compatible connections in the tangent bundle — a subtle interplay 
between the two theories has become apparent via work on Yang-Mills instantons 
in the late-1970s. This is particularly clearly illustrated by the fact that the index 
1 SU(2) instantons (self-dual solutions of the Yang-Mills field equations) are all 
gauge-equivalent to the self-dual part of the gravitational Levi-Civita connection 
for metrics conformally related to the standard round metric on S (see Atiyah 
et al. (1978), theorem 9.1). Rephrasing, one can say that all index 1 SU(2) 
instantons come from gravitational instantons. Thus, at least in the self-dual 
case, there is an intimate relation between the solutions of the two theories. It is 
this relation that is going to be at the root of the developments reviewed in this 
paper, with the notion of the self-duality playing the central role. 

The final word of caution is as follows. We make no claim here that by 
simply reformulating gravity in the language of gauge fields, one comes closer 
to understanding the problems of quantum gravity and unification. In fact, such 
reformulations were around for a long time, with historically the first being those 
due to Plebanski (1977) and MacDowell & Mansouri (1977), then followed by 
a new Hamiltonian formulation owing to Ashtekar (1987). These formulations 
of GR do shed some new light on the problems of gravity, but do not by 
themselves provide a solution of these problems. So, the novelty of the approach 
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described in this paper is not in the fact that a gauge-theoretic reformulation of 
gravity is possible — this was known for quite some time. It is in the fact that 
the specific gauge theory description to be reviewed here has certain attractive 
features not shared by other formulations, and these may eventually help us 
to come to terms with the abovementioned problems of gravity. Thus, one of 
the most interesting features of the new formulation is that the (Euclidean 
signature) action functional becomes convex, a very desirable property that the 
usual metric-based formulation does not have. This feature is in turn related to 
other simplifications in the structure of the theory, which is what gives us hope 
that the new formulation can shed new light on the problems of gravity. 

With these introductory remarks being made, we shall proceed with our gauge- 
theoretic description of gravity. We shall follow the bottom-to-top approach, in 
which we will first present a gauge theory description of gravity in as least general 
terms as possible, and then proceed to exhibit the general principle. Thus, in §2, 
we show how the linearized theory (i.e. gravitons) can be described in the gauge 
theory terms. We present the construction of the full nonlinear theory in §3. This 
section also discusses how the unification is realized in the present approach. 
We then discuss the quantum theory in §4. We reiterate the main points of our 
approach in §5. 

2. Gravity as a gauge theory: linearized level 

We start by showing how gravity can be described in gauge-theoretic terms at the 
level of the linearized theory. This is already non-trivial, because the statement 
that gauge fields are spin 1, while gravitons are spin 2 particles is the statement 
about the corresponding linearized theories. 

From the outset, we note the fact that the linearized GR described in terms 
of connections is not new. For instance, this is possible in the framework of 
Plebanski formulation of GR (Plebanski 1977) and related to it (via the space 
plus time decomposition) Ashtekar Hamiltonian formulation (Ashtekar 1987). 
The principal difference between these and our approach is that our linearized 
theory Lagrangian will be a functional of only the connection field, with no other 
(auxiliary) fields necessary (as in, e.g. Plebanski 1977). We shall see that this 
leads to some important simplifications, and that our linearized theory will be 
arguably simpler than all other known descriptions of the linearized gravity. 
In particular, as we shall see below, the diffeomorphisms are realized in our 
gauge-theoretic description in a particularly simple way (different from all other 
existing descriptions), and the corresponding gauge components of the field can 
be projected away from the very beginning, leading to considerable simplifications 
in the structure of the theory. In addition, we shall see that our Lagrangian for the 
linearized GR has very much the same form as that for the linearized Yang-Mills 
theory, which will eventually be of help when we put the two theories together. 

The gauge-theoretic description of free massless spin 2 particles given in this 
section appears to be new, and was first given by Krasnov (2011a). 

(a) Linearized Yang-Mills theory 

Let us start our description by briefly reviewing the linearization of the Yang- 
Mills theory. Let A be a connection in the principal G-bundle over the space-time 
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M, and let F = dA + (1/2) [A, A] be its curvature two-form. Here, we assume G to 
be compact. We use the conventions in which F = (l/2)F ll „dx^ L A dx" and often 
suppress the Lie algebra index of F for brevity. If is a (given) space-time 
metric, which we for now assume to be that of the Minkowski space-time, then it 
can be used to raise the indices of F M „ to get F^", which can then be contracted 
with F M „ to form the familiar Yang-Mills Lagrangian 

C YM = --^Tr(F^F, v ), (2.1) 

where the trace stands for a (in our conventions positive-definite) Killing-Cartan 
form on the Lie algebra of G, appropriately normalized. Thus, the trace can 
also be replaced by a sum over the Lie algebra indices of the curvature squared. 
At the linearized level (expanding around the zero background connection and 
introducing the canonically normalized perturbation a so that A = yyu a) , we get 

4m = -|Tr(a^"(8 M a, - 9,a M )). (2.2) 

As usual, the need for the minus sign in front of the action is dictated by the desire 
to have a positive-definite Hamiltonian. Indeed, introducing the space+time 
split n — (0, i), i = 1, 2, 3 and taking (for defmiteness) the metric signature to be 
(-,+,+,+) we get 

= |Tr((a, - d, M ) 2 - (B^ 2 ), (2.3) 

where the dot denotes the time derivative and we have introduced a notation 
Bi = eijkdjdk. We therefore see that the momentum canonically conjugate to cij 
is Ei := hi — dido, and the momentum conjugate to the time component of the 
connection oq is zero. Thus, the time component of the connection plays the 
role of the Lagrange multiplier imposing the so-called Gauss constraint, and the 

(2) . (2) 

linearized Hamiltonian % Y M = ^(^i^i) ~ ^ym ^ s gi ven D Y : 

MSi = !Tr((£,;) 2 + (Bf) 2 ) - Tr( % diEi), (2.4) 

where we have integrated by parts in the last term. The Hamiltonian is explicitly 
positive-definite (if the gauge group G is compact), modulo a constraint term 
that imposes the electric field to be trace-free = 0 and generates the usual 
gauge transformations of the connection 5^^ = di<p, (p being an arbitrary Lie 
algebra- valued function. The Hamiltonian is easily seen to be gauge-invariant, 
as the electric field does not transform (at this linearized level) and the linearized 
connection transforms so that the magnetic field B l is gauge- invariant. 

(b) Self-duality 

In preparation to the developments in the case of gravity, we will add to the 
Lagrangian (2.1), a certain total derivative term. This will have no effect on 
the physics, but will allow us to write the Yang-Mills Lagrangian in a form later 
almost matched by the gravitational one. To write down the total derivative term 
required, we need to introduce some new object that will play a very important 
role later. These objects are certain self-dual two-forms and thus build on the 
notion of the self-duality on two-forms (in four space-time dimensions that we 
work in). 
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Recall that, given a metric g^ v (such as, e.g. the Minkowski metric we have 
been working with up to now) one has the volume form e^ p(T for g^. Two of its 
indices can be raised with the metric to obtain an object e M / CT that can be applied 
to a two-form U^ v with the result being again a two-form — the Hodge dual of U. 
We have: 

U; v :=\^rU pa . (2.5) 

As it is not hard to check, in our case of a metric of the Lorentzian signature, 
the square of the Hodge operator is minus one. Thus, its eigenvalues are ±i, and 
the space of two-forms splits into two orthogonal subspaces. These are referred 
to as the spaces of self-dual and anti-self-dual two-forms. We note in passing that 
these properties of the Hodge duality operator are quire reminiscent of those of a 
complex structure (on the space of two-forms) , with the spaces of self- and anti- 
self-dual forms being analogous to holomorphic and anti-holomorphic elements. 
This is more than analogy, and we refer the interested reader to Brans (1974) for 
a development of this idea. 

Thus, for any self-dual two-form we have: 

l^rUp^W^ (2.6) 

and for anti-self-dual form, we have an extra minus on the right hand-side (r.h.s.). 
The space of self-dual two-forms is three-dimensional, and we can introduce a 
basis in it. A choice of such basic self-dual two- forms can be rather arbitrary as 
long as they span the required subspace. However, there is always a canonical 
(modulo certain gauge rotations, see below) orthonormal choice of the basis. Let 
us denote the canonical self-dual two- forms by S^ v , i= 1,2,3. Note that we have 
denoted the index enumerating the two-forms by the same letter as was used 
to refer to the spatial index in the Hamiltonian analysis above. This is not an 
oversight; we shall now see that the two indices can be naturally identified. The 
canonical self-dual two-forms are required to be orthonormal in that 

e^S^ = 8i5\ (2.7) 

where the numerical coefficient on the r.h.s. is convention-dependent, and 5 V is 
the Kronecker-delta. It can be shown that the self-dual two-forms satisfying (2.7) 
are defined uniquely modulo SO (3) rotations preserving 8 V . We can now give 
an explicit form of the basic self-dual two-forms in the case of the Minkowski 
space-time metric. Using the two-form notation, we have: 

r = idt a dx l + dx j A dx k , (2.8) 

it is not hard to check that S l are self-dual (with the conventions that e 0123 = +1), 
and that (2.7) holds. Let us also note what becomes of the components of the 
two- forms S % under the space+time split. We have: 

^u.-i'l and 4 = eV (2.9) 

Thus, we see that the objects S 1 ^ indeed provide a natural identification of the 
basis index i with the spatial index. This fact will be used later in our discussion 
of the gravitational case. 

Proc. R. Soc. A (2012) 



Review. Gauge-theoretic approach to gravity 



2135 



For later purposes, let us note an important identity satisfied by our self-dual 
two-forms. We have 

Sl'SP/ = -<5*V + e^S k /. (2.10) 

Thus, the basic self-dual two- forms satisfy an algebra similar to that of Pauli 
matrices. This identity can be checked by a direct verification, using the explicit 
expression (2.8). 

For our discussion of the Yang-Mills theory, we need one more identity 
involving S l . Thus, we note that we can use S's to construct the projector on 
the space of self-dual two forms. This is an operator P + whose square coincides 
with itself, and which projects any two- form onto its self-dual part. We have: 

nr - \ (Vxi - ^/*) = \v pa - (2- n ) 

Finally, we note that the self-dual two-forms E % have appeared in the literature 
on many occasions before. In the literature on instantons (self-dual solutions 
of the Yang-Mills held equations), these objects are often referred to as 't 
Hooft's symbols ('t Hooft 1976). They will be of fundamental importance in the 
considerations that follow. 

(c) Linearized Yang-Mills revisited 

We now use the objects introduced in the previous subsection to rewrite the 
linearised Yang-Mills Lagrangian (plus a surface term) in a convenient form for 
later purposes. Thus, modulo a total derivative term we have: 

4m = -2P w Tr(9 Ma! ,a p a ff ). (2.12) 
Using the basic self-dual two- forms S l introduced above, we can rewrite this as 

£® l = -iTr(E i i»d ll a,) 2 . (2.13) 

An alternative convenient way to write the above Lagrangian is to introduce a 
basis in the Lie algebra, so that a^ = a"T a , with o being the Lie algebra index 
and T a being the generators normalized so that Tv(T a T b ) = d ab . With this in 
mind, we rewrite the Yang-Mills Lagrangian as 

4m = -\KMX ltlv dX)(Z ]pa d P a b a ). (2.14) 

Our linearized gravitational Lagrangian below will strongly resemble this form of 
the Yang-Mills Lagrangian. 

An instructive exercise is to repeat the Hamiltonian analysis of Yang-Mills but 
starting from (2.14). To this end, we need the space+time split of the combination 
E^d^a® that appears prominently in (2.14). We have: 

J^"9X = -i(o? - didS) + e^%al (2.15) 

We now see that in this formulation of the theory, the momentum conjugate to 

Oj is 

7Ti := a, - 8,^ + i£< = E { + iB h (2.16) 

where E% is the usual electric held and we have used the same definition of the 
magnetic field Bi as above. We have again suppressed the Lie algebra index for 
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brevity. The Hamiltonian in this formulation reads 

-H% = Tr(|(7r 8 ) 2 - mBi) - Triaodiin). (2.17) 

This can be rewritten as 

H^l = |Tr((7r 2 - iB,) 2 + (B,) 2 ) - TrM^), (2.18) 

which is the same Hamiltonian as we have obtained above in (2.4). Indeed, this is 
obvious for the first term, and for the last Gauss constraint term, the shift of the 
momentum 7T, by a multiple of Bi has no effect as the magnetic held is transverse. 
We see that in this formulation, the momentum conjugate to the connection is 
not real, but rather only the combination 7T, — \Bi = E± is real. At the same time, 
the spatial part at of the connection is real. 

The reason why the complexity crept in is that we have added to the 
Lagrangian a term that is a purely imaginary total derivative. This has no 
effect on the dynamics, but it does affect the structure of the Hamiltonian 
formulation of the theory, in particular, the symplectic structure. This is 
why the momentum variable is no longer the electric held, but rather the 
combination (2.16). This corresponds to a canonical transformation on the phase 
space of Yang-Mills theory with a purely imaginary generating function. The 
analysis of this subsection is of course very familiar from the discussions of the 
(9-angle ambiguity in the quantization of Yang-Mills theory (see Ashtekar (1991), 
Section 13.2) for a very readable account. The only difference with the more 
familiar case is that the parameter of the canonical transformation is taken to 
be imaginary. This is certainly a legitimate operation at the classical level, with 
the price to pay being that one has to be careful about the reality conditions 
satisfied by the fields. We shall see that similar care will have to be exercised in 
the gravitational case, where analogous reality conditions are to be imposed. 

(d) Linearized gravity 

We now turn to the main object of our interest — gravity. We start by simply 
writing down the linearized Lagrangian of a form similar to (2.14), and later 
explain how it comes about. 

The first step is a choice of the gauge group, and we take G = SO(3). We shall 
then denote the Lie algebra index by the same lower case Latin letters from the 
middle of the alphabet, which we have already used to refer to the spatial indices. 
Again this is not an oversight, as we shall later see that these types of indices 
can be naturally identified. We then write a Lagrangian essentially of the same 
form as (2.14), but with a different tensor used to contract the two quantities 
E^d^a®, where a is now to be replaced by an SO(3) index j. Thus, we write 

4r = -\P l:m {^ v ^{^Q p a l a ), (2.19) 

where 

P m ■- pikjfij _ itfjjp (2.20) 

is a projector on symmetric trace-free 3x3 matrices. As we shall soon see, the 
Lagrangian (2.19) describes a spin 2 particle, in contrast to (2.14) that describes 
a spin 1 particle for each Lie algebra generator. The fact that (2.19) gives a spin 2 
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particle is related to the fact that (2.20) is a projector on spin 2 representation 
in the tensor product of two spin 1 representations of SO(3). 

Let us now discuss the invariances of the Lagrangian (2.19). As in the 
case of Yang-Mills theory, (2.19) is obvious invariant under the SO(3) gauge- 
transformations ci^a^ = 9^0* . Indeed, the partial derivatives commute, and the 
objects S m " are anti-symmetric in their space-time indices. However, unlike in 
the Yang-Mills case, the Lagrangian (2.19) has an extra invariance, and this is 
how the diffeomorphisms appear in the game. Consider 

= (2.21) 

where S % are the basic self-dual two-forms (2.8), and P is an arbitrary vector 
held. Note that there are no derivatives present in this transformation rule, this 
is simply a shift of the linearized connection in a certain direction in the held 
space. It is not hard to see that our Lagrangian is invariant under such shifts. 
Indeed, we use the algebra (2.10) of the basic self-dual two-forms and note that 
the product of two U's is in either the spin 0 part (the trace) or in the spin 1 
part (the anti-symmetric part) in the tensor product of two spin 1 representations. 
However, the Lagrangian contains the spin 2 projector (2.20), and so is insensitive 
to the shifts (2.21). So, (2.21) is an invariance of the Lagrangian. However, as we 
shall soon see, this is not a gauge invariance in the sense how the usual gauge 
transformations of the connection were the gauge invariances of the theory in 
the case of Yang-Mills theory. Thus, we shall see that there are no Lagrange 
multipliers for the constraints generating (2.21) in the Hamiltonian formulation 
of the theory. The significance of this fact will be discussed later. 

Let us now repeat the exercise of the Hamiltonian analysis of the theory. We 
have already made most of the steps needed for this in the discussion of the Yang- 
Mills theory. So, we can simply write the expression for S^d^Ovj from (2.15). 
We have: 

S^dpOvj = -i{hij - bi<M)j) + ef'dkaij (2.22) 

where the second index on the spatial connection is the lowered Lie algebra one. 
In the Lagrangian, this expression is multiplied by the spin 2 projector (2.20). 
Thus, we immediately see that only the spin 2 part of the connection held is 
dynamical. The momentum conjugate to this part of the connection is 

= P* l {{h u - a.oo,) + ie k mn d m a nl ). (2.23) 

We see that this expression depends only on the time derivative of the symmetric 
trace- free part of a^, on the temporal component of the connection as well as 
on spatial derivatives of the full connection a i; j. Let us now discuss the dependence 
on the latter. For this, it is very convenient to split the spatial connection into 
its irreducible with respect to SO(3) components. Thus, we have: 

aij = af + af + af, (2.24) 

where is the spin 2 part that is symmetric trace-free, is the anti-symmetric 
spin 1 part and a- 0 ' is the spin 0 part proportional to 5y. We immediately see 
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that because of the symmetrization imposed by the projector P^ k \ there is no 
dependence in the last term in (2.23) on the spin 0 part a y °\ Next, let us write 

af = e ijk c k , (2.25) 

where c k captures the spin 1 part of the spatial connection. Then, a simple 
calculation shows that the momentum (2.23) can be written as: 

^> = P*(4f + ie k mn d m a% - d k (aoi + ic,)). (2.26) 
We thus see that the Lagrangian (2.19) depends only on the spin 2 part of the 
spatial connection ay, as well as on the temporal ao, t and spin 1 components 
in the combination aoi + (i/2) e^aj^ . There is no dependence at all on the spin 0 
part a y °\ 

(2) ■■ (2) (2) 

Let us now find the Hamiltonian Hqr = ~ £qr- We have: 

Koi = H^ 3 ) 2 ~ ™ i3 *i U dkai 3 - (aot + \r )i) n\ (2.27) 

Here, we have removed the superscript (2) from the spin 2 component of 
the connection for brevity, with the understanding now that this is the only 
component of the connection that is dynamical. We have also removed all the 
projectors P^ kl because now the momentum -n i} is symmetric traceless and so no 
additional projection is necessary. In addition, we have integrated by parts in the 
last term. 

The structure of (2.27) is already familiar from the discussion of the Yang- 
Mills theory above. We could write the Hamiltonian in a more suggestive form 
by introducing the 'magnetic held' 

B»:=P^ kl e k mn d m a nl (2.28) 

and writing 

«S = - i^ 7 ) 2 + (B 13 ) 2 ) - (go, + ir)(>7T • (2.29) 

This is exactly analogous to (2.18), with the only exception being that the 
momentum variable ir v as well as the magnetic held B v are now spin 2 fields. 
As in the Yang-Mills case, the last term, when varied with respect to the 
combination a 0i + ici, generates the constraint that requires the momentum to 
be transverse. The reality conditions that need to be imposed to guarantee the 
positive-definiteness of the Hamiltonian are also analogous to the Yang-Mills 
case. Indeed, we see that the combination 7T 8? — must be required to be real, 
as well as the spin 2 part a\- of the spatial connection (and thus the magnetic 
held 

Thus, we see that at the level of the Hamiltonian formulation (as well as at the 
level of the action) , the treatments of Yang-Mills theory and gravity are exactly 
analogous. The main difference comes from the fact that in the gravity case, the 
gauge group has to be taken a specific one SO(3), and that certain propagating 
modes of the Yang-Mills theory with its two modes per generator have been 
projected out to leave only two propagating degrees of freedom of gravity. Indeed, 
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the effect of the projector P^ kl inserted in the Lagrangian (2.19) is precisely to 
remove four of the six propagating modes of SO (3) Yang-Mills theory, leaving 
only two gravitational d.o.f. At the level of the Hamiltonian (2.29), this projection 
is already carried out because only the spin 2 part of the spatial connection (and 
its conjugate momentum) is present. 

The above discussion of the propagating mode content of (2.19) can be 
rephrased as follows. The propagating d.o.f. of Yang are described by the spatial 
connection a; (with the Lie algebra index suppressed), which can be set (using 
gauge transformations) to satisfy the transverse condition 9 8 a.; = 0. In the case 
of gravity, we have exactly the same starting point, and the spatial connection 
a?, where now the Lie algebra index is explicitly indicated. Since the group 
SO(3) of gauge rotations in this case can be identified with the group of spatial 
rotations (this identification is provided, e.g. by the spatial projection of the self- 
dual two-forms S l ), the two indices of a\ are on the same footing. Thus, the 
spatial connection in this case is in the product of two spin 1 representations: a e 
V 1 ® V 1 , where our notation is that is the space of irreducible representation 
of SO(3) of dimension dim( V^) = 2j + 1. Decomposing V 1 <8> V 1 into irreducibles, 
we get a direct sum V 2 © V 1 © V°. The projector in (2.19) projects out the spin 

(2) 

1 and spin 0 components, leaving only a single dynamical spin 2 field a]J'. Gauge 

transformations (SO (3) rotations) still act on it and can be used to set this field 
to be transverse. Once this is done, the term (B^) 2 becomes (QjO^) 2 , as can be 
checked by a simple calculation. Thus, once the gauge has been fixed, we get the 
following linearized Hamiltonian: 

4r = K(^) 2 + (9»^) 2 ), (2.30) 

where we have denoted the real momentum by p v = ix %] — iB tJ . This is the usual 
linearized Hamiltonian of the metric-based GR, as we shall review in §2e. 



(e) A comparison to the linearized Einstein-Hilbert action 

Above we have seen how a spin 2 particle can be described by a Lagrangian 
of the Yang-Mills type, but with an additional spin 2 projector inserted. At the 
level of the Hamiltonian formulation, we have seen the familiar from the metric- 
based treatment symmetric traceless transverse tensors appearing. The natural 
question that arises is how the above Lagrangian is related to the usual metric- 
based GR Lagrangian linearized around the Minkowski metric. The purpose of 
this subsection is to discuss this relationship. 

The Einstein-Hilbert Lagrangian (for our choice of the signature ( — ,+ ,+ ,+)) 
reads: 




(2.31) 



where k 2 = 32irG, with G being the Newton's constant, and R the Ricci scalar. 
We take the background metric to be the Minkowski one, and linearize (2.31) 
taking the perturbation to be Kh^, so that h^ v is the canonically normalized field. 
We get the following linearized Lagrangian: 

4h = -U d ° V) 2 + + (9" V) 2 + OTV, (2-32) 
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where as usual we have introduced the notation h := g^h^, with g^ v being the 
background (Minkowski) metric. 

We note that (2.32) is much more involved than the gauge-theory Lagrangians 
we have encountered above. We also note that the trace part h of the metric 
perturbation behaves like a held with a wrong sign in front of its kinetic term. 
This is of course not a source of problems in GR, for this held turns out not to 
be dynamical. However, this does create problems for the path integral approach 
to the gravitational quantum theory, where one has to integrate over all metric 
fluctuations, including those described by h. This mode makes the Euclidean 
signature path integral not convergent. We will come back to this point below. 

The above Lagrangian is invariant under the following transformations: 

<5 f /t M „ = 9( M f„), (2.33) 

which represent linearized diffeomorphisms. 

The Hamiltonian description of the linearized GR is (briefly) as follows. 
Choosing the time+space split and decomposing the metric perturbation 
into its hoo, hti-, hy components, we get (after quite some rearrangements and 
cancellations) the following involved structure: 

4h = |(%) 2 - kihkkf ~ |(9 2 ^) 2 + !(a 4 M 2 + (9 8 %) 2 + /' - 9 />. 
+ ho 0 ^h kk - d'&hy) + 2//„ <i // . 

where we have introduced: 

p v := _ (2.34) 

It is clear that is the momentum canonically conjugate to hy and that 
the components hoo, hoi have vanishing conjugate momenta and are Lagrange 
multipliers for constraints. One can then see that it is possible to set the trace 
part of p v by an action of a time-like diffeomorphism, and set the tracefree part 
hy of hy to be transverse d l hy = 0 by an action of a spatial diffeomorphism. 
Imposing these gauge-fixing conditions, we get the following simple Hamiltonian 

0/(2) ij il /*(2) 

"•EH — P n ij ~ EH" 

«S = i((P*) 2 + (W 2 ), (2-35) 

where now both the metric perturbation and the conjugate momentum are 
symmetric traceless transverse tensors. We do not give all the details of this 
involved analysis; the main point of this discussion being to illustrate how much 
more complicated the case of metric-based GR is when compared with our 
treatment in the previous subsection. We have also verified that exactly the 
same Hamiltonian appears from the usual metric-based analysis and from our 
linearized Lagrangian (2.19). The Hamiltonian (2.35) resulting from the metric- 
based treatment, as well as the fact that the dynamical fields are symmetric 
traceless transverse tensors, is the reason why the graviton is referred to as a 
spin 2 particle. Having derived the same description of the graviton from the 
gauge theory Lagrangian (2.19), we have thus shown that the gauge-theoretic 
description of a spin 2 particle is possible. 
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Having reviewed the usual linearized GR situation, we are ready to discuss 
the question of the relation between our gauge-theoretic and the metric-based 
approaches. We have seen that once all the constraints are imposed and all the 
gauge freedom is fixed, one gets exactly the same Hamiltonian in both cases, the 
one describing two propagating polarizations of the graviton. Using the technical 
jargon, we can say that on-shell, the two descriptions of gravity are the same. 
The question is if they are the same off-shell, i.e. if there exists a held redefinition 
(possibly complicated) that maps (2.19) into (2.32). We will now give arguments 
that prove that there is no such a map, and so the two descriptions are different 
off-shell. 

First, let us do a simple count of the number of components in the fields 
involved. The metric perturbation contains 10 components, while the 
connection contains 12. However, in fact, as we have seen that the Lagrangian 
(2.19) depends only on eight of the 12 components of as it is invariant under the 
shifts (2.21) in the field space. It is certainly not possible to express 10 components 
of the metric perturbation in terms of eight components on which (2.19) depends. 

Another reason why the Lagrangians (2.19) and (2.32) cannot be related is 
that they have very different properties in the field space. It is easiest to see this 
in the case when one passes to the Euclidean signature. Then there is no factors 
of i in the gauge-theoretic description, and the full connection is real. The self- 
dual two-forms in this case are also real. Then the gauge-theoretic Lagrangian is 
just the trace of the square of the real symmetric tracefree matrix P^S^d^c^i. 
As such, it is explicitly non-negative. On the other hand, the Lagrangian (2.32), 
even when continued to the Euclidean signature, is still not positive-definite. 
Indeed, by choosing the conformal mode part h of the metric perturbation to be 
a field quickly changing in space-time one can make the Euclidean Lagrangian 
arbitrarily negative. This is well-known as the conformal mode problem in the 
Euclidean path integral. Thus, we are dealing with very different functionals in the 
two descriptions. In the gauge-theoretic description, the (Euclidean) Lagrangian 
is convex (apart from the fiat directions corresponding to the gauge symmetries) , 
while in the metric-based treatment, the functional is convex in some directions in 
the field space and concave in others (specifically the conformal mode direction). 
The critical point (s) of both functionals are the same, but it is clear that there 
cannot be any map from the field space of one description to that of the other 
that can relate two functionals with completely different properties. Thus, we 
conclude that the above gauge-theoretic description of linearized gravity is off- 
shell different from the description given by the Einstein-Hilbert action. This fact 
will have important implications when we discuss the quantum theory. 

(/) Spinor description 

The above description of the linearized theory, as well as the comparison 
with the metric-based approach, can be simplified quite considerably if one uses 
spinors. The main idea of the spinor description is that the space S + ® S_, where 
S± are the spaces of the so-called unprimed and primed spinors (inequivalent two- 
dimensional representations of the Lorentz group), is isomorphic to the Minkowski 
space. This isomorphism is described by the object 0^ A ' , where we use the 
relativist notations for spinors and denote the spinor indices by upper case Latin 
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letters from the beginning of the alphabet. The spinors X A ' of opposite type to X A 
are called primed. This is in contrast to the particle physics notations where the 
spinor indices are Greek letters, and are referred to as undotted and dotted. The 
(hermitian in our conventions) object 0 AA is called the soldering form. It encodes, 
in particular, the Minkowski metric via = —G aa '6 v aa', where the minus in this 
formula is convention (and signature)-dependent. Using the soldering forms one 
can convert any space-time index into a pair of spinor indices of opposite types. 
Thus, for example, the gauge held a® becomes a\ An where a is the Lie algebra 
index. For more information about spinors and their role in geometry, the reader 
is referred to Penrose & Rindler (1984). 

We now describe the linearized Lagrangian (2.19) in spinor terms. As we shall 
see, most of the properties discussed above become manifest in this description. 
We already know that the space-time index of the connection becomes a pair 
AA' . We then recall that the self-dual two- forms provide an isomorphism 
between the space of self-dual two-forms and the Lie algebra of SO (3), and take 
in the spinor language a very simple form. Indeed, a two-form X^ becomes in the 
spinor description an object Xaa'bb'- This is anti-symmetric under the exchange 
of pairs A A' and BB' . As such, it can be decomposed into either an object that 
is symmetric in AB and anti-symmetric in A'B', or the object anti-symmetric 
in AB and symmetric in A'B'. Using the fact that the only AB anti-symmetric 
object is the metric in the space of spinors €ab, we have: 

Xaa'bb 1 = ^A'B'Xab + (-abX-a'B', (2.36) 

where Xab and Xa>b' are symmetric objects. Then, it can be shown that the 
6a'B'Xab part of X^ v is its self-dual part, and zabXa'B' is the anti-self-dual one. 
The objects S l being self-dual are thus of the form €a'B'^ab- We can then use 
the objects H l AB to identify the Lie algebra index i with a symmetric pair AB of 
unprimed spinor indices. All in all, when all of the indices are replaced by the 
spinor ones, our linearized connection becomes the object a BC aa 1 - 

Thus, we see that the linearized connection a BC aa' is an object taking values 
in <g> S + (g> S- , where the notation is that S" is the space of symmetric rank n 
unprimed spinors. This space can be decomposed into its irreducible components: 

S2®5 + ®5- = S*®5-©5 + ®$_, (2.37) 

where the completely symmetric a( ABC h> in its three unprimed indices part of 
a BC aa' lies in S+ <8> S- and the trace part a EA EA' lies in S+<8)S-. It is then 
easy to see that the part of the connection that can be shifted by the action 
of a diffeomorphism 5%a % =!; a E % is precisely the S + ®S- part. And indeed, 
the linearized Lagrangian (2.19) written in terms of spinors take the following 
extremely simple form: 

4ft~(9iV OT)A ') 2 , (2.38) 

where the numerical coefficient in front is convention-dependent and is not of 
importance for us here. As the symmetrization on all four unprimed indices is 
taken, the above Lagrangian is explicitly independent of the S + ® S- part of the 
connection. It is also easy to check that the Lagrangian (2.38) is invariant under 
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the gauge transformations d^a aa' = &AA'4 > ■ Indeed, this follows from the fact 
that d AA 3^, ~ e AB \3 and thus contains only the AB anti-symmetric part, which 
gets killed by the symmetrization in (2.38). 

It is worth emphasizing how much simpler is the description (2.38) when 
compared with all other known descriptions of linearized gravity. We shall give a 
comparison with the Plebanski-Ashtekar theory in the following subsection. Here, 
we would like to present a brief comparison with the spinor language metric case. 
In the spinor description, the metric perturbation /i M „ becomes Iiaa'BB', which is 
symmetric under the exchange of pairs AA' and BB' . As such, it decomposes 
into an object Haba'B 1 6 <5+ ® S^_, as well as the object tAB^A'B'h proportional to 
the trace of h^ v . Thus, there is 9 + 1 = 10 held components, and the Lagrangian 
(2.32) is a (complicated) function of all these fields. We note that even writing the 
Lagrangian (2.32) in the spinor form produces a rather cumbersome expression 
that is not particularly illuminating. 

We have already remarked that the gauge-theory gravitational Lagrangian 
(2.19) is very similar to the Yang-Mills Lagrangian in the form (2.14). This 
similarity becomes even more striking in the spinor description. Thus, suppressing 
the Lie algebra index of the Yang-Mills held, we can write (2.14) as 

(2-39) 

Thus, the only difference between this Yang-Mills theory and (2.38) is that the 
former uses spin 1 fields in the representation S + <g) S_ (per Lie algebra generator), 
while the latter uses spin 2 fields in the representation S+ <g> S-. Otherwise, both 
Lagrangians are constructed according to the same principle using the Dirac 
operator Qaa'- Indeed, in (2.39), the Dirac operator is used to map 8 : S + <g> S_ — > 
S+, and this symmetric rank two spinor is just squared to obtain the Lagrangian 
(and then the trace over the underlying Lie algebra is taken). In the case of (2.38), 
the principle is exactly the same except that the map is now 9 : <g> S- — >■ , 
as is relevant for spin 2 particles. Below, we shall see that the similarity between 
this description of gravity and Yang-Mills theory extends beyond the linearized 
level, and that interaction vertices in our gauge theory description of gravity take 
very similar form to those familiar from the Yang-Mills. 

Let us now see how the count of the number of degrees of freedom works 
in the metric and the gauge descriptions. The first difference is that the gauge 
theory Lagrangian is a function of only eight fields taking values in <8> 5_ , when 
compared with 10 fields from <8> S£ © (trivial) in the metric case. In this sense, 
the gauge theory description is more economical than the metric-based one. The 
count of the propagating modes is then as follows. From the eight fields that 
the Lagrangian (2.38) depends on, three become Lagrange multipliers for three 
Gauss constraints, which gives the usual two propagating degrees of freedom. 
This is to be contrasted with the metric case count, where four out of 10 metric 
components are Lagrange multipliers for four constraints, which again give two 
propagating modes. The point that we would like to emphasize is that the gauge- 
theory description is less redundant — a smaller number of fields than in the metric 
case is sufficient to produce the same on-shell picture. 

Second, the Lagrangian (2.38) is arguably a very simple and natural construct 
involving the basic field, in comparison with the metric Lagrangian that is not 
even easy to write down with all its four different terms. The final, crucial 
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difference comes from the fact that the Euclidean version of the Lagrangian (2.38) 
is positive-definite (being the trace of the square of a real symmetric tracefree 
matrix) , while the Euclidean version of the linearized Einstein-Hilbert functional 
is not definite (because of the 'conformal' mode h). As we have already discussed 
above, this shows that the two descriptions can only be equivalent on-shell, while 
they are necessarily different off-shell. 

Thus, all in all, the gauge-theory formulation (2.38) seems to be a much more 
economical and elegant description of gravity than the usual metric-based one. 
There is, however, a price to be paid for this simplicity. Indeed, the fields of the 
metric-based description take values in S+ ® S_ plus the trivial representation of 
the Lorentz group. In Lorentzian signature, the representations S + and S- go into 
each other under the complex conjugation. Thus, the representation <8> goes 
into itself under the complex conjugation, which explains why the metric-based 
description works with real fields. In contrast, in our approach, the basic field 
takes values in <8> S- , which under the operation of complex conjugation goes 
to 5£ ® S + , a completely different representation. In other words, our description 
of gravity is necessarily chiral, with gravitons of one helicity being described in 
a different way from those of the other helicity. The chirality also implies that 
the gauge-theory description cannot be by real fields (apart from the cases of 
Euclidean (+,+,+, +) and split (—,—,+,+) signatures where the spinors are 
real objects). As the result, the issue of reality conditions becomes non-trivial. 

(g) A comparison with the Plebanski-Ashtekar gauge-theoretic description 

We close this section with a comparison between the above gauge theory 
description and that owing to Plebanski (1977) and (in the Hamiltonian 
formulation) Ashtekar (1987). A detailed description of the corresponding 
linearized theory is available in (e.g. Ashtekar (1991), ch. 11). Here, we will 
only give a very brief comparison, for more details, the reader is referred to the 
above references. 

In Plebanski-Ashtekar description, one starts from a certain action functional 
that depends on an SO(3) connection A 1 ^, as well as an su(2) Lie algebra-valued 
two- forms field B 1 ^, as well as a set of Lagrange multipliers. The precise form of 
this functional is not important for us here. What matters is the fact that it is 
not a functional of the connection only, as in our description, in that it depends 
on many other fields. In the Hamiltonian description many of these fields become 
non-dynamical. Thus, as usual, the temporal part of the connection becomes the 
Lagrange multiplier for the Gauss constraint. The temporal part B^ of the B-field 
contains nine components, of which five are set to zero by constraints obtained by 
varying the action with respect to the Lagrange multipliers, and the remaining 
four are the Lagrange multipliers for the diffeomorphism constraints. The spatial 
part bL ~ ejki^ h of the two-form field plays the role of the momentum conjugate 



to the spatial connection. The linearized theory constraints are then as follows: 

(),7T • () (Gauss), 



and 




0 (Diffeo) 

0 (Hamiltonian), 



(2.40) 
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where a\ is the spatial connection. The analysis of the propagating mode content 
of this system is quite non-trivial (it can be found in Ashtekar (1991), ch. 11), 
with the outcome being that the usual symmetric traceless transverse tensors are 
the propagating degrees of freedom. 

Let us now compare the above with our gauge-theoretic description. The 
first difference is that our Lagrangian is a functional of the connection only, 
with no other auxiliary fields present. The second (crucial) difference is that 
the diffeomorphisms are realized in our description in a completely different 
way. Indeed, the spatial diffeomorphism as well as the Hamiltonian constraints 

(2.40) are non-trivial, and follow from varying the Plebanski action with respect 
to certain components of the temporal part of the B-field. In contrast, in 
our description, there are no fields in the action varying with respect to 
which one gets the diffeomorphism constraints. Instead, the action is simply 
independent of certain components of the connection. At the level of the 
Hamiltonian description, this is described by the statement that the following 
constraints hold: 

tt [ ^ = 0 (Diffeo) , tt" = 0 (Hamiltonian) . (2.41) 

As we have already said, these follow from impossibility to solve for certain 
components of the velocity in terms of the momenta, and not by explicitly varying 
the Lagrangian with respect to some Lagrange multipliers. What the constraints 

(2.41) generate are of course simple shifts of the connection variable in some 
directions. As a result, the analysis of the propagating mode content of (2.19) is 
much simpler than the corresponding analysis in the Ashtekar case. 

All in all, our gauge-theory description is considerably different from the 
Plebanski-Ashtekar formulation, with both the linearized Lagrangian and 
Hamiltonian formulations being much more compact in our case. It is also 
worth emphasizing that the Plebanski Lagrangian, being essentially the Einstein- 
Hilbert Lagrangian with some extra fields added, is non-convex, while the 
(Euclidean) linearized Lagrangian (2.19) is explicitly convex. Thus, there are clear 
benefits from using the description (2.19) when compared with the Plebanski- 
Ashtekar formulation. 

Finally, we note that our Lagrangian (2.19) can be obtained from that of 
Plebanski theory by the process of integrating out the two- form and the Lagrange 
multipliers fields, as was explained in Krasnov (2011 b), and then considering the 
linearization of the resulting action, as will be explained below. So, Plebanski 
formulation of GR and the formulation that is the subject of this review are not 
unrelated. However, we will refrain from explaining this link in detail, as this will 
take us too far from the main subject of this paper, which is how to describe 
gravity by a theory with a connection as the only dynamical field. 



3. Diffeomorphism-invariant gauge theories 

In this section, we finally explain where the postulated above linearized gravity 
Lagrangian (2.19) comes from. We will also explain how both gravity and Yang- 
Mills theory arise from a single source — a general diffeomorphism-invariant gauge 
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theory. In this section, we switch gears completely, and follow a top-to-bottom 
approach, where we first present a general principle, and then specialize to the 
case of gravity plus Yang-Mills. 

Some historical remarks are in order. While the gauge theories we encounter 
in this section are relatively new — in the form described here they have been 
introduced in works of the present author — they are intimately related to 
certain generally covariant theories of connection that has been discovered and 
studied in the early 1990s. In particular, the constructions of this section will 
make it clear that GR is not the only interacting theory of massless spin 2 
particles. This fact has been known for quite some time in a different, but 
not unrelated formulation. The history of these developments is (briefly) as 
follows. It was realized in work (Capovilla et al. 1989) (see also Capovilla et al. 
1991) that the zero cosmological constant GR can be reformulated as a theory 
of an SU(2) connection. The Lagrangian of this description, however, contains 
not just the connection held, but also an additional auxiliary held of density 
minus one. It was then quickly realized that in this formulation GR is not 
unique. Thus, a two-parameter family of 'neighbours' of GR was introduced 
and studied in Capovilla (1992), and then an infinite parameter of theories 
constructed along the same lines was proposed in Bengtsson (1991). All these 
theories are not ones of 'pure connection', as we have already said, they contain 
an additional auxiliary field. But they all have the key property that they 
describe just two propagating polarizations of the graviton. The same theories 
were rediscovered (in a different formulation) in Krasnov (2006) and this new 
version was related to earlier developments in Bengtsson (2007). The novelty of 
the approach followed here is that Lagrangians that are used are functions of 
only the connection field, with no auxiliary field being necessary. However, our 
theories can be though of as those studied in the 1990s with the auxiliary field 
integrated out. 



Let us start by describing how gauge theory actions can be constructed in 
the absence of any space-time metric. This question is non-trivial, for up to 
now, we only know how to formulate a gauge theory (i.e. Yang-Mills theory) 
when a space-time metric is available (actually, since Yang-Mills is classically 
conformally invariant , only a conformal class of the metric is needed) . It is not at 
all clear how the space-time indices of the curvature two-forms can be contracted 
in the absence of any metric. 

To begin to answer this question, we note that the only object that can be used 
to contract the indices of copies of F® v and that is available without a metric is 
the tensor density V Lvpa . This is a completely anti-symmetric object that in any 
coordinate system has components ±1. We thus see that the following object with 
all space-time indices contracted is available: 



where we have explicitly indicated that X ab is a density weight one (by putting 
a tilde over the symbol). The numerical prefactor of 1/4 is introduced for 



{a) A class of gauge theories 
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convenience so that with our conventions F a A F b = X ab d^x. We now construct 
our Lagrangian by considering the most general gauge-invariant quantity that 
can be produced from (3.1). 

Thus, the task is now to construct a scalar from (3.1). Moreover, this scalar 
must be a density one, so that it can be integrated over the space-time to produce 
an action. Thus, what we require is a function from the space g ®s 9, where g is 
the Lie algebra of G and S denotes the symmetrization, to the real (or possibly 
complex) numbers, and this function must be homogeneous of degree 1: 

/: 0 ® 50 ^IR(C), f(aX) = af(X) VIeg%g,VaeC*, (3.2) 

where the last condition comes from our desire for f(X) to be of density 
weight 1. We also want our action to be gauge- invariant , and so as the gauge 
transformations act on the curvature two-forms by rotating the Lie algebra indices 
a (using mathematical terminology, the adjoint action) , we also require 

f( a d g X)=f(X) VXeQ® S Q,VgeG, (3.3) 

where ad g X is the extension of the adjoint action of G in g to g (g>s g. 

One can then easily convince oneself that the class of such functions is not 
empty by giving examples. The first obvious example is obtained by taking 
f(X) = Tr(X). However, it is easy to see that the corresponding Lagrangian is 
a total derivative, and thus does not give rise to an interesting dynamical theory. 
A more non-trivial example is f(X) = Tr(X 2 )/Tr(X), which is homogeneous of 
degree 1 as required and gauge-invariant. In general, one can construct a very 
large set of functions with desired properties by taking all possible (independent) 
invariants constructed from X e g <g>s g, and then taking an arbitrary function 
of homogeneity degree zero ratios of such invariants, and multiplying it by, say, 
Tr(X) to get the required total homogeneity degree 1. For our purposes, here it is 
enough to know that a very large class of functions with required properties exists. 

Having discussed the properties of functions that can be used in constructing 
the action, we are ready to write down an action principle for a diffeomorphism- 
invariant gauge theory. Using the form notations, we have 



S[A] = i 



f(F A F), (3.4) 



where it is understood that the wedge product of two curvature two-forms 
is 4-form valued in g (8)5 0, and the properties of function / make f(F A F) 
a well-defined 4-form that can be integrated over the space-time manifold to 
get the action. The imaginary unit i is introduced in front of the action for 
future convenience. 

For future use, we note that the held equations following from (3.4) when this 
action is varied with respect to the connection are: 

d A (-¥-F b \=0. (3.5) 

Here d A is the covariant exterior derivative with respect to the connection A, 
and df/dX ab is the matrix of partial derivatives of the function / with respect 
to the components of its matrix argument X ab . We note that the matrix of 



Proc. R. Soc. A (2012) 



2148 



K. Krasnov 



first partial derivatives is a function of X that is homogeneous of degree 0. 
It is worth emphasizing that the held equations (3.5) are (nonlinear) partial 
differential equations that are second order in derivatives. The fact that only 
second-order differential equations appear is reassuring, for higher order held 
equations typically lead to instabilities. 

It is instructive to compare (3.5) with the held equations following from 
the Yang-Mills action functional. These read d^F" 1 = 0, where the dependence on 
the space-time metric enters via the Hodge star operation applied to F. Thus, the 
equations (3.5) are similar to those of the Yang-Mills theory, except that instead 
of applying to the curvature the Hodge star operator (which does not exist in our 
case), one contracts it with a matrix of first derivatives of the function /, which 
is itself depending on the curvature. Thus, equations (3.5) are well-defined in the 
absence of any space-time metric. 

Importantly, we note that there are no dimensionful constants involved in the 
definition of the theory (3.4). Indeed, it is natural to take the dimensions of the 
connections to be those of 1/L, L being length (or, using the standard terminology 
mass dimension 1). The quantities X ab are then of mass dimension 4, and owing 
to the homogeneity of /, so is the Lagrangian. Thus, there are only dimensionless 
constants involved in the construction of the Lagrangian of our theory, and these 
are hidden as the parameters of the function /. 

Our final remark is about the generality of the above construction of a 
diffeomorphism-invariant gauge theory. We have considered actions that are built 
from the quantities (3.1). The natural question to ask is if there are any more 
involved gauge and diffeomorphism-invariant functionals of the connection that 
can be constructed. As we shall discuss in more detail below, this question is 
very important for understanding of the quantum theory based on (3.4), for it 
is related to the question of how the class of theories (3.4) behaves under the 
renormalization. To discuss this question, we note that a certain 'metric' tensor 
can be produced out of the curvature two-forms via: 

9nv = ^ yS 9ab C^F^ a F^F yS . (3.6) 

Here C® d are the structure constants, and g a b the Killing-Cart an metric on the 
Lie algebra q. The object is a tensor density of weight 1, symmetric under the 
exchange of [iv. It is a general group G analogue of the so-called Urbantke metric 
(Urbantke 1984). The choice of the proportionality coefficient in this formula 
is rather arbitrary, and one can, for example, always divide the object (3.6) by 
Tr(X) to get an object of density weight 0. So, in general (3.6) is only defined 
modulo rescalings. However, what is important for us here is that this object 
exists. In general, it is a non-degenerate tensor, which thus has an inverse. This 
inverse can then be used to contract indices of the operator of the covariant 
derivative d^n with that of the curvature F® a , producing a Lie algebra- valued 
object with one space-time index. This object can in turn be used to construct 
gauge and diffeomorphism-invariant objects of the type different from those 
considered before. Thus, the class of theories (3.4) describes only a subset of 
diffeomorphism-invariant gauge theories. This subset can be characterized by 
noting that for any actions other than (3.4), the resulting field equations are 
higher than second-order in derivatives. Thus, the class (3.4) consists of those 
diffeomorphism-invariant theories that lead to not higher than second-order field 
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equations. It is therefore certainly justified to restrict one's attention to such 
theories at low energies. A discussion of issues arising at high energies, when also 
the quantum effects become important, will be given below. 

We would like to close this subsection with two references on somewhat 
analogous constructions of action functional that have appeared in the literature. 
Thus, Hitchin (ePrint) presents an action that is a functional of a p-form in 
n-dimensions. Such an action is constructed as a homogeneous of degree n/p 
function of a form. The result is a well-defined n-form that can be integrated 
to produce a 'volume' functional. The difference with our case is that we 
have constructed an action from Lie algebra-valued 2-forms (in four space-time 
dimensions), while in Hitchin (ePrint), non-trivial functionals arise for higher 
rank forms (3- and 4-forms) in higher dimensions. The forms in Hitchin (ePrint) 
are usual differential forms, and not forms with values in vector bundles as in 
our context. 

Another relevant construction is Endlich et al. (2011), where an action principle 
for a perfect fluid is given in a form quite analogous to our (3.4). Here, one also 
works with symmetric matrices valued in some internal space, and constructs a 
gauge-invariant function of such matrices which is then integrated to produce an 
action. In addition, the requirement of invariance under the volume-preserving 
diffeomorphisms is imposed. The main difference with our context is that our 
basic held is a connection one-form, while in Endlich et al. (2011), it is a space- 
time scalar with values in some internal space. The perfect fluid construction 
of Endlich et al. (2011) suggests that the elementary excitations of our theory — 
gravitons — can be interpreted as analogues of phonons. Indeed, our gravitons arise 
as gapless modes once the original symmetries of the theory — diffeomorphisms 
and gauge rotations — are broken to a smaller (Poincare) subgroup by the 
background. We now turn to describing the linearization of (3.4) around an 
appropriately chosen background. 

(b) Background 

We start by noting that the construction (3.4) of a diffeomorphism-invariant 
gauge theory is empty in the simplest case G = U(l). Indeed, with the Lie algebra 
being one-dimensional, there is no other independent invariant of X ab except the 
trace (which in this case coincides with the quantity itself), and so the only 
possible theory in that case is the topological one (with the action being a total 
divergence). The next to most simple case is G = SO (3), which we shall now see 
describes gravity. 

We will first explain how the Lagrangian (2.19) is obtained from (3.4) when 
the theory is linearized around a specific background. We will see that for 
this calculation, the specific form of the function / defining the theory is not 
important. Any choice of / satisfying a certain weak non-degeneracy condition 
will produce (2.19) via linearization. Then, in the next subsection, we describe 
how a specific choice of / gives GR. 

We now need to discuss the background connection to be used for the 
linearization. We present a general description valid for any gauge group G, and 
then later specialize to the case G = SO(3). We first note that because ratios 
of invariants of X ab are used, we cannot take the zero (flat) connection with 
vanishing curvature. Indeed, the ratios of curvature invariants that are arguments 
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of the function / are then ill-defined. This is probably the most significant 
difference with the case of Yang-Mills theory, where the theory is completely well- 
defined around the zero connection. In our case, to obtain an expansion around 
the Minkowski background, there is no other choice but to resort to some sort of 
limiting procedure. A natural limiting procedure is to take a constant curvature 
connection and then send the radius of curvature to infinity, thus reproducing the 
Minkowski space-time. We shall see that this is a well-defined procedure with an 
unambiguous outcome. Alternatively, one can interpret this limiting procedure 
as that of finding the linearized description of the theory (3.4) for a constant 
curvature background, on scales much smaller than the curvature scale. We note 
that this is a physically realistic situation, because the metric of our Universe 
is not Minkowski, and, during the current late stages of the evolution of the 
Universe, is in fact quite close to the de Sitter space. 

To describe the background connection, we shall require it to have a large 
degree of symmetry, so that it can legitimately be referred to as a 'vacuum' state 
of our theory. The symmetry requirements that we impose are those standard 
in cosmology: homogeneity and isotropy. Thus, we require that there exists a 
foliation of the space-time such that on each three-dimensional slice of the 
foliation, the connection is homogeneous, i.e. looks exactly the same at every 
point. This is formalized by introducing a time coordinate, which we for now shall 
refer to as 77, so that 77 = const, are the time-slices of homogeneity. The connection 
will then only be allowed to depend on 77. We then require the connection to be 
spherically symmetric, i.e. such that the effect of spatial rotation around any 
point on 77 = const, hypersurface can be removed by a gauge transformation. It 
is not hard to show that this fixes the connection to be of the form 

a a 

^backgr = (3.7) 

where x % are the usual M 3 coordinates on 77 = const, surfaces, a® is a function of 77 
only, and we have used the availability of time-dependent gauge transformations 
to eliminate the dw components. The requirement of spherical symmetry further 
requires af to be an embedding of the Lie algebra so(3)~s/(2) of the group 
of spatial rotations SO(3) into the Lie algebra q, i.e. a map that sends sl(2) 
commutators into Q commutators. We refer the reader to the study by Krasnov 
(2011c) for a demonstration of this claim. In the case G = SO(3), there is only 
one such (non-trivial) embedding, and so in this case a- is proportional to <5j, but 
in the case of a larger gauge group G, there are typically several inequivalent (i.e. 
non-conjugate in G) embeddings. Later, we shall see that it is very interesting to 
consider different embeddings in (3.7), as one gets very different particle spectra 
around non-equivalent 'vacua'. 

Let us now rewrite the connection components a? as 

o? = e?o(ij), (3.8) 

where e° is an embedding of si (2) into g normalized so that /"fee^ = e'yi-ef, and 
a{rj) is an arbitrary function of time 77. We shall now see that the only freedom 
in choosing the vacuum (3.7) is the choice of the embedding, and that the time 
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coordinate can always be conveniently chosen so that the dependence of a on 
time is fixed. Thus, let us compute the curvature of (3.7). We have 

F a = -a 2 el [\°^ d V A dx l + dx 3 A dx k ^j . (3.9) 

We now choose the time coordinate so that a'/a 2 dr] = dt, which fixes the 
connection component a as a function of t: 

a(t) = - i — t<to, (3.10) 
to — t 

where to is an arbitrary integration constant. 

Let us now inspect the background curvature (3.9) more closely. The expression 
(3.9) shows, in particular, that in the six-dimensional space of all two-forms, the 
background curvature components span a three-dimensional subspace 

A+ :=Span(F a ), dim(^I + ) = 3. (3.11) 

We can now introduce a metric, or rather a conformal class of metrics, by requiring 
this subspace A + C A 2 to be that of self-dual two-forms with respect to the metric. 
It is known that this requirement fixes the metric modulo conformal rescalings. 
Let us stress that we are not forced to introduce a metric, as our theories are 
perfectly well-defined and can be studied in the absence of one. However, as 
all the physics as we know it happens in the metric background, it is very 
convenient to introduce a metric so that later, we have a chance to see the familiar 
physics arising. We are not yet claiming that the metric introduced via the above 
mathematical construction is a physical one, in the sense that it is the one that 
the matter fields feel. However, we shall later see that this is indeed the case. 

Let us now discuss the ambiguity in the choice of the representative in the 
conformal class defined by declaring A + to be the self-dual two-forms. Given 
any such metric, let 6 1 be the corresponding tetrads: ds 2 = 8 1 ® 0 J rju, where 
I , J = 1, ... ,4 and r]jj is the Minkowski metric. We can then define the 'canonical' 
self-dual two-forms for this metric via 

S i :=i0°A0'+ ISjkP Ad k . (3.12) 

For metrics related by a conformal rescaling g^ v —¥ Q 2 g^ v , the corresponding self- 
dual two- forms S l are related via S l — >■ Q 2 I<^ V . Thus, any metric whose self-dual 
two-forms 2 are a multiple of 

£^ ink := idt A dx l + \ij k dx 3 A dx k (3.13) 

is a possible metric in the conformal class fixed by (3.11). It is, however, very 
convenient to work with a metric such that 

F a = -M 2 e*Z\ (3.14) 

where M 2 is a constant (i.e. time-independent) parameter. This is the de 
Sitter metric 

ds 2 = c 2 (t) {-dt 2 + ^(da;') 2 ^ , (3.15) 
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of constant curvature M 2 



or of the cosmological constant 




(3.16) 



Here 



c(t) = 



1 



(3.17) 



M(to - t) " 



One could also consider anti-de Sitter space if one so wished by taking the 
parameter M to be purely imaginary. However, as the observational evidence 
points to the existence of a positive cosmological constant, we choose our 
background metric to be that of a de Sitter space. 

The introduced parameter M of dimensions of mass is completely arbitrary, 
for one can always rescale the metric by a constant, while at the same time 
rescaling M 2 by the inverse constant without changing the background curvature 
F a . We now remind the reader that our theories do not have any dimensionful 
constants. We will later see that once (3.4) is expanded around backgrounds (3.7), 
interpreted as equipped with a metric of constant curvature M 2 , the dimensionful 
parameter M serves as a seed for all other dimensionful parameters in the theory, 
such as masses (for some massive fields that will be discussed below) , and such as 
dimensionful coupling constants (such as e.g. the graviton interaction strength). 
Thus, the parameter M is in fact just a unit of mass for our theories. As such, 
it is inconsistent to ask about its value, for there is nothing else with which this 
value can be compared. 

All our backgrounds (3.7) are constant curvature, in e.g. the sense of equation 

(3.14) . Thus, the most symmetric solution of our theory (3.4) is naturally a 
Universe with a cosmological constant (in the sense of the de Sitter metric 

(3.15) ). In this sense, a non-zero cosmological constant need not be explained 
in our formulation of gravity, it comes as a given. Moreover, as we have already 
stressed above, it is logically inconsistent to ask about its value, for being the only 
dimensionful parameter and that there is nothing with which this value can be 
compared. The cosmological constant is just a unit in which all other quantities 
get expressed. 

The above discussion has very interesting implications. In our theories, the 
cosmological constant A, together with the usual c, h, becomes the fundamental 
constant, such that any other constant of Nature can be expressed as a multiple 
of some combination of A, c, h. Thus, for our class of theories, the cosmological 
constant A plays the same role as the Newton constant plays conventionally 
In fact, we shall see that the Newton constant that measures the strength of 
interaction of gravitons does get expressed as a multiple (necessarily very large) 
of M~ 2 . When put in this way, the famous cosmological constant problem of why 
A is so small becomes in our case the question of why M 2 ~ 1/G is so large (when 
compared with A). We will come back to this question below. 



Before we proceed with our analysis of the theories (3.4) linearized around the 
background (3.7) we make a small detour and compute the value that the action 



(c) Action evaluated on the background 



Proc. R. Soc. A (2012) 



Review. Gauge-theoretic approach to gravity 



2153 



takes when evaluated on (3.7). This small computation is related to the issue of 
other physical parameters being expressed in terms of the cosmological constant 
discussed above. 

We have F a A F b = M 4 e^e b S i A W = 2iM 4 e? e) \f=~g d 4 x , where j=g= c\t) 
is the square root of the determinant of the de Sitter metric. So, we have 

r 

SUbackgr] = -2M 4 f{m) 

where we have introduced a notation m ab := e?e*5^ for the pullback of the metric 
on sl(2) into g by the embedding ef. Comparing this with the value of the 
Einstein-Hilbert action evaluated on the constant curvature metric, we see that 
we must identify 

2M i f(m) = ^. (3.19) 

In other words, the product M 4 f(m) must be identified with the energy density 
of the cosmological constant. Now, using the above identification (3.16), we see 
that we must have 

/(m)~ M 2 p /M 2 . (3.20) 

This is a very large number M 2 /M 2 ~ 10 120 . Below, we shall see similar large 
numbers appearing if we want to get the usual graviton interaction strength 
from our approach. Thus, we learn that the functions / that are capable of 
reproducing the realistic physics are quite special in the sense that the above sort 
of scales appear. We will come to this point when we discuss what the cosmological 
constant problem becomes within our approach. 




(d) General linearization 

To see how (2.19) arises, let us first derive the general linearization formula, 
without specifying the G = SO(3) case. Thus, we will continue to use a,b,... 
for the Lie algebra indices. Now, let A"^ be (an arbitrary for now) background 
connection and bA a be a perturbation. The first variation of (3.4) reads: 



5S = 2i 
The second variation reads: 



9/ 

dX ab 



F a A D A 5A b 



(3.21) 



8 2 S = i 



a 2 / 



dX ab dX cd 
9/ 



(F a A D A 8A b )(F c A D A 5A° 



+ 2^—(D A 5A a A D A 5A b + F a C bcd dA c A 8A d ). 



dX 



(3.22) 



Here C abc are the structure constants. One must substitute the background 
curvature (3.14) (as well as the background connection) into this second-order 
action, and study the spectrum of the arising excitations. 
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(e) Case G = S0(3): linearized theory 

We now note that in view of (2.7), the background value of the matrix X VJ is 
proportional to 5' K > . Using this fact, we can say a great deal about the matrices 
of first and second partial derivatives of / present in (3.22). First, it is clear that 
the matrix of first derivatives can only be proportional to 5'K To see this, one 
can think about the function f(X) as being a power series expansion in traces 
of powers of X, as well as inverses of such traces. It is then clear that the value 
the matrix of first derivatives of such a function can take on <5 ij must itself be 
proportional to 8 V . The proportionality coefficient is then a constant (because 
the background was chosen to be of constant curvature). We can then integrate 
by parts in the first term on the second line in (3.22) and, assuming that the 
connection perturbation vanishes outside of the region of interest, see that the 
result cancels with the second term. So, for the background considered, we can 
ignore the second line in (3.22). 

Let us now consider the term involving the matrix of second derivatives of /. 
The form of this matrix can be determined from the fact that / is a homogeneous 
function of order one. This property can be summarized by writing 

-^Lx ij = f. (3.23) 
dXv 

This identity can be differentiated with respect to X kl with the result being 

9 2 / 



QXvQX kl 



X l] = 0. (3.24) 



When the background value of X %] is proportional to 5 l -\ this implies that the 
matrix of second derivatives of / gives zero when contracted with the Kronecker 
delta in any of its pairs of indices. This implies that the matrix of second 
derivatives is proportional to the projector on the symmetric tracefree tensors: 



a 2 / 



(I.V o.v 



= -lp® u . (3.25) 



X=5 ^ 



where g is some constant, capturing the content of the Hessian of the function / 
at = <5 i? , and the numerical factor and the sign are introduced for convenience 
(we will later see that g is positive for GR) . Note that g = 0 for the function 
f(X) = Tr(X) corresponding to the topological theory, but for a generic choice of 

/, it is non-zero. The Hessian at the point X' 7 = 2iM\J—gd^ is easily determined 
using the homogeneity of /. All in all, collecting all the factors and dividing the 
second variation of the action by two to get the linearized action we get: 



S (2) = 9 
2 



f=-gP^\ZfD,oA V] ){X p °D p 5Aoi)- (3-26) 



Note that factors of M have cancelled. Now, to get the Lagrangian (2.19) we define 
a new, canonically normalized field a^ = iy/gdA 1 . Note that there is a factor of the 
imaginary unit in this formula, as in e.g. (3.7). We now take the limit M — > 0, or, 
alternatively, consider the above linearized theory on scales much smaller than 
the scale of the curvature. In both cases, we can replace the covariant derivative 



Proc. R. Soc. A (2012) 



Review. Gauge-theoretic approach to gravity 



2155 



present in (3.26) by the ordinary derivative, and remove the volume density for 
the metric. We get: 



P^ k \^ v ^a vj ){^d p a al ), 



(3.27) 



which is the integral of the Lagrangian (2.19) studied above. 



(/) Case G = SO(3): recovering full general relativity 

We have just seen how for G = SO(3), the linearization of (3.4) around 
the constant curvature background (3.7) produces the gauge-theoretic gravity 
Lagrangian (2.19) studied in the previous section. This has happened 
independently of which function / was taken in the construction of the action. We 
shall now show that for a particular choice of /, the full on-shell content of (3.4) 
is the same as in GR. In other words, we will now show that for a specific /, there 
is a correspondence between solutions of GR and solutions of theory (3.4). As we 
have already discussed in the section on the linearized theory, we can at best hope 
to have an on-shell correspondence between the two descriptions, for the off-shell, 
even the linearized theories have completely different convexity properties. 

Let us first state that the function / that corresponds to GR is given by Krasnov 
(20116): 

where G is the Newton's and A is the cosmological constant. Note that, 
as discussed above, there are only dimensionless constants entering into the 
construction of /. We also note that the function / evaluated on the identity 
matrix X iJ = <5 - ? is of the order M 2 /M 2 , which we knew we should have expected 

from general considerations leading to (3.20). We also note that the cosmological 
constant A enters / in the denominator, so the function / as well as the action 
blow up in the limit A— > 0. Thus, strictly speaking, only the non-zero A case 
GR is covered by our gauge-theoretic description. However, the flat case can 
be obtained by a limiting procedure of the sort we have used in the previous 
subsection. 1 We feel that it is not a drawback of our scheme that only the non- 
zero A case GR admits an honest action principle. Indeed, it is now commonly 
believed that there is a small cosmological constant in the Universe (or at least all 
the observational data are compatible with the dark energy being a cosmological 
constant). Thus, the action principle with / as above is sufficient to describe 
the physical Universe. Also, as we have said, if desired, the properties of the 
flat Universe can be recovered by a limiting procedure. In addition, as we shall 
indicate below, the present gauge-theoretic formulation may allow to explain the 
form of the 'physical' function / as an outcome of some sort of renormalization 

1 A 'pure connection' action functional that covers the A — 0 case is also possible, and is known 
as the Capovilla-Dell-Jacobson (CDJ) action (Capovilla et al. 1989). It is, however, not of the 
type (3.4) as it contains an additional auxiliary field. The CDJ action can be thought of as the 
(5-function imposing the condition that the trace of the square root of the matrix of wedge products 
of the curvature vanishes. The same condition follows from (3.28) in the limit A — > 0, and in this 
sense the A = Q case is covered by our description via a limiting procedure. 
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group flow. Thus, the present approach may have something to say about the 
problem why A is so small — the famous cosmological constant problem. More 
remarks on this will be given below. 

The notion of the square root used in (3.28) requires clarifications. As it 
can be shown, (see Krasnov 20116) for more details, in the case of Euclidean 
signature GR, the matrices X that arise are squares of real matrices and therefore 
are real and not indefinite, i.e. all their eigenvalues are of the same sign (with 
some possibly being zero). For such matrices, there is a well-defined notion of 
the square root, and this is what is used in (3.28). In the case of the physical, 
Lorentzian signature GR, the situation is more complicated, because the matrices 
X are in general complex. However, even in this case, we can define the square 
root in the neighbourhood of the identity matrix by writing \[X = ^5 + (X — 5) 
and expanding in powers of the small matrix X — 5. This definition is sufficient 
for most practical applications, and certainly for the perturbative treatment of 
gravity. Thus, one can take the position that function (3.28) can be defined 
by a power series expansion for curvatures whose departure from the constant 
curvature case is not large. And in the large curvature case, where a perturbative 
definition would break down one cannot trust the theory of GR anyway, as higher 
order corrections to the GR Lagrangian (induced by e.g. quantum effects) must 
be taken into account. To summarize, the function (3.28) is well-defined in all 
situations where one is interested in classical GR predictions. 

To show that the theory with (3.28) is on-shell equivalent to the usual GR 
with its Einstein-Hilbert action, let us first write down the field equations (3.5), 
specialized to the case of the defining function / given by (3.28). Let us first 
define, for any / 

B l := —l-F ] . (3.29) 
dXv 

The field equations (3.5) are then (IaB 1 = 0. For the function (3.28), we have 

-S- = l —(Tiy/x~){X- 1 l' 1 ) i \ (3.30) 

where a negative power of the matrix X v should also be understood in the sense 
of a power series expansion. It is then easy to check that precisely for the function 

(3.28) , we have the following important identity satisfied by the two-forms B % : 

B l A B 3 ~ fl«". (3.31) 

One can reverse this argument and say that the function / in (3.28) is chosen 
precisely in such a way that the identity (3.31) holds. This identity is analogous 
to (2.7) already encountered above. 

Overall, we get the following structure. First, any SO(3) connection A 1 gives via 

(3.29) a set of two-forms B' satisfying (3.31). It is then known that such a triple of 
two-forms defines a unique space-time metric g^ v . The metric g^ v is determined by 
the requirement that the triple of two-forms B l is self-dual with respect to g^ v (this 
defines the metric modulo conformal transformations) , and that the volume form 
of the metric coincides with a multiple of Tr(5 A B). For an explicit expression, 
one can use the Urbantke formula (3.6), specialized to the case q = su(2), with 
the two-forms B l substituted in place of F l . Then for a connection satisfying its 
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field equation d^B 1 = 0, it can be shown that A 1 coincides with the self-dual part 
of the Levi-Civita connection for the metric defined by B'. Finally, the defining 
relation (3.29) can be read as saying that the curvature of the self-dual part 
of the Levi-Civita connection is self-dual as a two-forms, which is known to be 
equivalent to the Einstein condition, see proposition 2.2 in Atiyah et al. (1978), 
or Krasnov (2009) for an exposition oriented towards a physics audience. This 
establishes the on-shell equivalence of theory (3.4) with (3.28) with Einstein's 
GR. As we have already discussed, there is only the on-shell equivalence, i.e. one- 
to-one map between the spaces of solutions of field equations. Off-shell behaviour 
of the functionals (3.4) with (3.28) and the Einstein-Hilbert functionals are very 
different. See the study of Krasnov (20116) for a more detailed discussion of the 
equivalence of the two theories. 

Let us also discuss the relation in the opposite direction. Here one takes an 
Einstein metric and then computes the self-dual two-forms E l given by (3.12), 
which satisfy B l A B 3 ~ 5 V . One also computes the self-dual part A* of the spin 
connection. It can then be seen that these objects satisfy oIaB 1 = 0, as well as an 
equation F % = W ij B 3 , where W lJ are arbitrary coefficients, with the only condition 
being that Ti(W) ~ A. This shows that the self-dual part of the Levi-Civita 
connection is our sought A\ which satisfies all the relevant field equations. The 
relation between the two descriptions can be made much more explicit, but here 
we will refrain from going into more details. Some more details will be given in 
the next section where we discuss the quantum theory and, in particular, the 
graviton scattering. 

(g) Deformations of general relativity 

Let us now discuss what happens when the function / used in the construction 
of the Lagrangian is different from (3.28). As we have already mentioned, when 
f(X) = Tr(X), we get a theory without any propagating degrees of freedom. As 
we have also seen above via the linearization, for any generic choice of /, one 
gets a theory describing a massless spin 2 particle. Thus, choices of / different 
from (3.28) give rise to interacting theories of massless spin 2 particles that are 
different from GR. Thus, it appears that for a generic choice of /, the action 
(3.4) with G = SO(3) gives rise to a counterexample to the GR uniqueness 
theorems, which are often quoted as saying that GR is the only consistent 
theory of interacting massless spin 2 particles, see e.g. the very first paragraph of 
Dvali et al. (2011). The purpose of this subsection is to explain why the above 
construction of the gauge-theory gravitational theories in no way contradicts the 
known uniqueness theorems. 

Let us start with a brief review of the GR uniqueness results. This review 
cannot be exhaustive, as unavoidably only the results known to the author 
will be mentioned. Historically, the first type of uniqueness results is based on 
an old idea that the nonlinearities of gravity can be reconstructed from the 
fact that the gravitons must couple (in a canonical way) to their own stress- 
energy tensor. This line of reasoning is best-known from the description given 
in Feynman et al. (1995). Another well-known uniqueness result is Hojman 
et al. (1976), which works at the Hamiltonian level and analyses the algebra of 
constraints of GR. Another approach to GR uniqueness is based on the graviton 
scattering amplitude kinematical constraints (Grisaru et al. 1975). Finally, the 
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last well-known approach that we mention is due to Wald (1986), and analyses 
the conditions for the infinitesimal gauge symmetries visible at the linearized level 
to integrate into a gauge symmetry of the nonlinear theory. 

We now discuss in turn each of the methods for proving the GR uniqueness. Let 
us start with the derivations that use the fact that the graviton held h^ v couples to 
its own stress-energy tensor. In more details, the idea here is that knowing that 
the spin 2 held /i M „ couples canonically to its own stress-energy tensor (which 
is computed from the linearized action), one can hope to reconstruct the full 
nonlinear action of gravity. This was achieved (to some extent) by Feynman 
et al. (1995), and a much cleaner argument using the first-order formalism was 
given by Deser (1970). A similar argument fixes the coupling of any type of 
matter to gravity. We note that even in this case, there are some controversies 
and ambiguities, as was recently emphasized by Padmanabhan (2008). Our next 
comment is that this type of derivations assume that the only way to describe 
a spin 2 particle is via a field with a symmetric pair of space-time indices. 
As we have seen, this is not at all the case, and it is equally possible to describe 
spin 2 particles using a connection variable a 1 , where z = 1,2,3 is an SO(3) index. 
And the proofs based on the graviton descriptions are simply inapplicable to 
the connection description. 

One could then ask a question if a similar, not same, logic can be applied in the 
connection case. However, in this case, we are unable to say that the connection 
a 1 ^ couples to its own stress-energy tensor (as could be derived from the linearized 
Lagrangian). Indeed, there is simply no notion of the stress-energy tensor in our 
context (as there is no metric, and so, no notion of the variation of the action 
with respect to the metric). The best one could hope for is that the connection 
a* couples to some current J* that can be obtained (in some canonical way) from 
the linearized Lagrangian. One could then iteratively determine the nonlinearities 
and completely fix the theory. However, we do not see any physical requirement 
for why the connection should couple to a certain canonical current, at least in 
the presently considered case of pure gravity with no matter fields. Indeed, in 
the case of the usual gravity, the requirement of coupling of h^ v to its own stress- 
energy comes about because one knows how gravity couples to the other matter. 
The consistency of the field equations (at next to linear order) then require the 
gravitational stress-energy coupling as well. No similar argument can be made 
in our present context of pure gravity. The case of gravity plus matter will be 
commented on below when we discuss gravity plus Yang-Mills unification in our 
framework. Thus, the line of reasoning based on Feynman et al. (1995), Deser 
(1970) is not applicable directly, because a different field is used to describe the 
spin 2 particles (o£ instead of h^). At the same time, an analogous argument 
would require some notion of the universal current for gravitons, and it is not 
clear what physical argument would guarantee its existence. 

A more directly applicable line of thought (Wald 1986) makes no assumption 
that gravitons couple to their own stress-tensor. Instead, the idea is simply to 
analyse the possible nonlinear completions of some linear field equations. Then, 
if some gauge symmetry was present in the linear theory (and thus there was 
an associated Bianchi identity), then there should be a nonlinear version of the 
Bianchi identity (consistency condition) . This consistency condition is then very 
restrictive and limits the types of possible gauge symmetries. In our case, this 
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type of analysis can be applied directly, taking into account the fact that the spin 
2 particle is now described by instead of /i M „. One can then see quite easily 
that there are two types of Bianchi identities satisfied by the held equations (3.5). 
One of these is a differential identity 

d A d A (J^Fi)=0. (3.32) 

Using the fact that the commutator of two covariant derivatives is the curvature, 
the above identity can be written as: 

0 = e m F k A -^-F ] ~ e tkl X k] -^-, (3.33) 

where we have used the definition of the matrix X v as the wedge product of two 
curvatures. The identity is then a direct consequence of the gauge-invariance of 
the function /. Note that (3.32) is in fact three equations, and is a differential 
identity for the field equations. It is clear that this identity has its origin in the 
gauge-invariance of the action. 

Another type of identity does not involve any derivatives of the field equations. 
It reads: 

i^F l Ad A ^F'j=0, (3.34) 

and, as we shall see, holds for any vector field £. Here is the interior 
multiplication with a vector field £ (and, in (3.34), the curvature 2-form). To prove 
this identity, we use the 'usual' Bianchi identity d A F l = 0 to take the curvature 
2-form F : ' out of the brackets. The quantity t: F^ A F 3 ' , where the brackets denote 
the symmetrization, is then proportional to L^(vol)X ij , where (vol) is the volume 
form used to define X ij via F % A F : > = (vol)X ij . So, the above identity reduces to 

X ^(^) =0 ' < 3 - 35 ) 

which is a consequence of the homogeneity (of degree one) of the function /. It is 
clear that (3.34) is related to the property of the action being invariant under the 
diffeomorphisms. We also note that the above arguments are valid for any choice 
of the gauge group, provided e ijk in the proof of the first identity is replaced with 
the appropriate structure constant. 

To summarize, we see that for any choice of /, the theory (3.4) satisfies two 
Bianchi-type identities, with one of them (3.32) being a differential identity 
involving the field equations, and the other (3.34) being simply a statement 
that the field equations are not linearly independent. The gauge symmetries of 
the theory are then the usual gauge rotations as well as diffeomorphisms. No 
additional requirements is placed on the allowed interactions. In particular, no 
restrictions are placed on the type of function / that can be used in construction 
of the Lagrangian, provided this function satisfies the requirements of gauge- 
invariance and homogeneity. No unique Lagrangian (such as that of GR) can 
follow from such arguments. 

Another GR uniqueness claim (Hojman et al. 1976) is based on the canonical 
analysis. The main idea of the argument is that the phase space of any theory of 
gravity is that of pairs (spatial metric, conjugate momentum). One also knows 
that the spatial metric must appear in the result of the commutator of two 



Proc. R. Soc. A (2012) 



2160 



K. Krasnov 



Hamiltonian constraints. This, together with some assumptions, allows one to 
fix the form of the Hamiltonian constraint and thus the gravitational dynamics. 
This argument is not applicable in our case because it can be shown by a 
direct analysis (see Krasnov 2008) that the canonical variable that arises in the 
case of theories under consideration is not the spatial metric. The algebra of 
diffeomorphisms is indeed the usual expected algebra, but in our case, the spatial 
metric appearing in the commutator of two Hamiltonian constraints turns out 
to be a complicated function of the canonical variables. It thus appears that 
the form of the Hamiltonian cannot be fixed via this type of argument in the 
case when the conngurational canonical variable is not directly related to the 
(spatial) metric. 

Finally, yet another argument for GR uniqueness Grisaru et al. (1975) is based 
on the analysis of the 4-graviton scattering amplitude and showing that it is 
essentially determined (up to a numerical factor) by the kinematical constraints 
only. However, this analysis assumes parity invariance of the amplitudes (to 
reduce the number of the independent amplitudes) . This assumption is explicitly 
violated by a generic theory from our family, with only the theory with (3.28) 
being parity invariant. Thus, while it appears that there is indeed a unique parity 
invariant gravity (which is GR), there also exists a large family of interacting 
theories of massless spin 2 particles that are not invariant under parity. 

Much more can be said about the classical behaviour of the deformations of 
GR, but we cannot go into such details because of the space restrictions. We just 
mention that the spherically symmetric (BH) solution can be obtained for an 
arbitrary / (see Krasnov & Shtanov (2008) and also Krasnov & Shtanov (2009) 
for a more informal discussion). We note that the study by Krasnov & Shtanov 
(2008) uses a certain equivalent (but not obviously so) formulation of the same 
class of theories (similar to Plebanski formulation of GR). A derivation of the 
spherically symmetric solution is also possible directly in the 'pure connection' 
framework presented here, but this will be spelled out elsewhere. We note in 
passing that a general theory from our family exhibits an interesting resolution 
of the singularity inside the black hole phenomenon, which is described in detail 
in the study of Krasnov & Shtanov (2008). 



(h) Gravity-Yang-Mills unification 

Our above discussion of the coupling of gravitons to themselves (and to other 
matter) has already made it clear that this coupling is not as straightforward as 
in the usual metric-based description. In particular, there seems to be no notion 
of some canonical stress-energy tensor through which all matter would couple 
(and gravitons would self-couple). It then appears that the only consistent way 
to couple matter to our gravity theory is by enlarging the gauge group in the 
action (3.4). As we shall soon see, an SO(3) subgroup of such a larger group 
will still describe gravity, while the part that commutes with this SO (3) will 
describe the Yang-Mills sector. Constructions of this sort where first spelled out 
(in the framework of Plebanski-like formulation) by Torres-Gomez & Krasnov 
(2010) and Torres-Gomez et al. (2010). For earlier work with similar aims (in 
the context of the Hamiltonian formulation), see Chakraborty & Peldan (1994). 
Here, we describe what appears to be a much simpler derivation. 
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Thus, let us take a general theory from the family (3.4), with some gauge group 
G. As we have already discussed above, a particularly nice set of backgrounds, 
or 'vacua' for our theory is provided by (3.7). Such a background connection 
selects some SU(2) ~ SO(3) subgroup of G. Note that there are in general several 
inequivalent embeddings of SU(2) into a larger gauge group. The idea is to fix 
such an embedding and look at the spectrum of excitations that arise around this 
background. Let us first look at the linearized theory, i.e. an expansion of the 
action up to second order in the fields. It is then easy to see that the SU(2) part 
of the gauge field perturbation is still described by the linearized action (3.27), as 
no details of the derivation presented in the corresponding subsection get changed. 
So, we still get two propagating polarizations of the graviton in this sector. 

Let us now discuss the sector of the theory described by 5A a with the index a 
being in the part of the Lie algebra of G that commutes with the fixed SU(2). We 
note that this part may be empty (this depends on the embedding chosen). Let 
us first consider the terms in the second line of (3.22). The first term becomes 
(after the limit M — > 0 is taken) a total derivative and can be dropped. In the 
second term, for the background chosen, the only non-zero values of F a are in 
the SU(2) part of the Lie algebra. But by assumption, the commutator of the two 
connections in the part of the Lie algebra that commutes with SU(2) does not have 
any component in the SU(2) part. So, the second term in the second line of (3.22) 
is also zero. We are left only with the first line. Here, the matrix of the second 
derivatives of the function / must be evaluated on the background X ab ~ (<5 iJ , 0), 
where one has 5 V in the SU(2) part of the matrix and zero everywhere else. Also, 
one is only interested in d 2 f /dX m dX^ b part of the matrix of the second derivatives. 
One can easily convince oneself that this matrix can only be proportional to & %] 5 ab . 
Thus, we can write: 

d 2 f k 

-b ij 5 a \ (3.36) 



dX m dXJ b 2 

where k is some constant. Note that this holds only for the part of the Lie algebra 
that commutes with the SU(2), because for the part that does not commute, there 
are other possible invariants that can appear on the r.h.s. of (3.36). Collecting 
all the constants (and dividing the second variation by two), we get the following 
linearized Lagrangian: 



o(2) 
YM — 2 



8»s<*(2 i ' a 'd ll 6AZ)(2 ip *d p dA*), (3.37) 



where we already took the limit M — »■ 0. As in the SU(2) sector, the factors of M 
have cancelled out in this derivation, and the only effect of the limit is in replacing 
the covariant derivatives by the usual ones (and in removing the factor). 
We now rescale a® = y/icdA^ and get the usual linearized Yang-Mills action in the 
form (2.14) encountered in the beginning of this paper. 

Thus, it is very easy to see how both gravity and Yang-Mills linearized theories 
arise from the 'mother' theory (3.4), once it is expanded around the background 
(3.7). We see that the first of the expectations spelled out in §1, namely that 
a gauge-theoretic reformulation of gravity may be of help for the problem of 
unification with the other forces, seems to be at least partially fulfilled. 
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(i) Symmetry breaking 

As we described above, backgrounds (3.7) that can be used as 'vacua' for our 
theories are in one-to-one correspondence with embeddings of the Lie algebra 5/(2) 
into the full Lie algebra q. We have discussed the interpretation of the connection 
components charged under the part of the Lie algebra Q that commutes with the 
embedded sl(2). Thus, we have seen that these describe massless gauge bosons. 
It can be shown (Krasnov 2011c) that the other connection components, i.e. in 
those directions in q that do not commute with the embedded s/(2) typically 
describe massive fields of non-zero spin. 

A very interesting symmetry-breaking scenario then becomes available. One 
can change the 'vacuum' around which the theory is studied. In one vacuum, 
where the centralizer of the embedded 5/(2) is non-trivial, we have the 
corresponding massless gauge bosons. In a vacuum with trivial centralizer, all 
massless gauge bosons of the previous vacuum become massive particles. Thus, 
by choosing the background connection (and thus the embedding of s/(2)) 
appropriately, we can break the Yang-Mills gauge group as desired. 

This symmetry-breaking mechanism is described in detail in Krasnov (2011c). 
Here, we restrict ourselves to just illustrating how this mechanism works on the 
example of G = SL(3). Note that we work with complex Lie groups, so we make no 
distinction between, e.g. SU(3) and SL(3). There are two inequivalent embeddings 
of SL(2) into SL(3). One is the obvious one that embeds 2x2 matrices in, e.g. 
the upper left corner of 3 x 3 matrices in SL(3). This embedding breaks the 
original symmetry down to SL(2) x U(l), where the latter is generated by the 
matrices diag(l, 1, —2). As we have seen in the previous subsection, components 
of the connection charged under this U(l) gauge group describe massless gauge 
bosons — i.e. a Maxwell electromagnetic potential. The other fields that arise in 
this background turn out to be massive spin 3/2 particles that are electrically 
charged. There are two such fields, oppositely charged. It is unusual to have fields 
of half-integer spin described by commuting connection components, and so one 
might worry about the Hamiltonian being not bounded from below. However, 
the Lagrangian for this fields is second order in derivatives (and is of the same 
type as all U-containing Lagrangians we have encountered above). It is not hard 
to choose the reality conditions so that the Hamiltonian is explicitly positive- 
definite, and so no problems of the type seemingly predicted by the spin-statistics 
theorem arise. For more discussion on these issues, the reader is directed to 
Krasnov (2011c). 

Let us now consider the other inequivalent embedding. In this case, there is 
no non-trivial centralizer of the embedded SL(2) in SL(3), and all the gauge 
symmetry apart from the gravitational SL(2) is broken. The massive field content 
of this model is as follows. One finds a massive spin 3, as well as a massive 
spin 1 field. We can say that the massless gauge boson have absorbed one of 
the components of the massive fields of the previous background, and became 
massive. The other massive fields have rearranged themselves into a spin 3 field. 

To summarize, we have all the ingredients of a symmetry- breaking mechanism, 
where by changing the background, we can break the gauge symmetry as desired, 
with massless gauge bosons of one background becoming massive in another. 
Many more examples of this mechanism are described in Krasnov (2011c), 
including those with a realistic Standard Model symmetry-breaking pattern. We 
refer the reader to this reference for details. 
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We finish this section by stating that, in spite of the fact that our description 
of gravity is so very far from the standard one that there is a legitimate worry, 
that one will never be able to couple any realistic matter to it, at least some types 
of matter can be coupled very naturally. Thus, we have seen that the gauge fields 
can be coupled just by enlarging the gauge group. Quite interestingly, massive 
particles can also be coupled in the same way. In particular, and this is far from 
obvious, spin 0 massive particle can also arise in this framework (see Krasnov 
(2011c) for an example). It remains to be seen whether the Standard Model 
fermions can be embedded into this framework. It currently seems that if this is 
to be possible, the anti-commuting fermionic variables must be described by the 
grading-odd elements of some super Lie algebra. 



4. Quantum theory 

In this section, we finally discuss whether our gauge-theoretic formulation of 
gravity sheds any new light on the problem of quantum gravity. We start with 
a description of how the perturbative quantum theory looks like in the new 
language. 

(a) Interactions 

Like in the metric-based GR, one can expand the full nonlinear action (3.4) 
around some background, and then attempt to compute quantum corrections to 
the theory by evaluating Feynman diagrams. If one wants the asymptotic states 
to exist (one usually does), one has to work around the Minkowski space-time 
background. One complication (that can be overcome successfully) in our case 
is that the Minkowski space-time connection is A = 0, and as such it is difficult 
to expand around (imagine having to expand the Einstein-Hilbert action around 
the zero metric). So, the natural strategy appears to take the constant curvature 
background, and expand the action around it, taking the flat limit afterwards. 
We have already seen how the linearized gravity action (3.27) appears this way. 
One can continue this process, and compute the interaction vertices. It turns 
out, however, that one should be very careful about taking the limit M — >0. 
Indeed, one finds, e.g. that the cubic interaction vertex for our gravitons is of the 
schematic form: 

£W~—(daf+— (da)a 2 . (4.1) 

Here, we have considered the case of the theory with / given by (3.28) that 
corresponds to GR. Then M p is the Planck mass M? For a general /, 

the cubic interaction is schematically the same, except that one has 1/M 2 in 
front of the first term, multiplied by some dimensionless combination of the 
parameters of / (see below). We remind the reader that the parameter M is 
the length scale introduced by the background (3.14), which is related to the 
cosmological constant as in (3.16). For details of this derivation, the reader can 
consult (Krasnov 2011a) (or perform a straightforward analysis him/herself). 

There are several points to be noted about (4.1). First, there are non- 
renormalizable (by power-counting) interactions, such as the first term in (4.1). 
Second, the interactions seem to blow up in the limit M — > 0. Finally, unlike in 
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the metric-based GR, we get more than two derivatives in the interaction vertices. 
This can be easily seen to be a general feature, and the order n vertex will have 
as many as n derivatives in the form (8a)". We can write: 

£W ~(i4)^ (8a) " + -' (4 - 2) 

where the dots denote terms that have a lower number of derivatives. We also note 
that the number of derivatives in each successive term changes by two (as in (4.1)), 
so the next term in (4.2) is proportional to (9a)"~ 2 a 2 . Thus, the perturbation 
theory is seemingly very different from the standard one. 

Both of these puzzles (blowing up interactions and the higher number of 
derivatives) can be understood by noting that there is a relation between the 
connections and the metric perturbations, which is valid on-shell only, and which 
reads, schematically, 

h=—da. (4.3) 
M V ' 

There is yet another relation which is also valid on-shell, and which reads: 

a=Uh. (4.4) 

Both of these can be true at the same time only on-shell 9 2 = M 2 . These relations 
can be used, in particular, to determine the helicity spinors in the connection 
description from those in the metric description. 2 

We see that it is tricky to take the limit M — > 0. This certainly cannot be 
done at the very start of the computation, because the interaction vertices 
seem to diverge in this limit, and so do the helicity states determined from e.g. 
(4.4). Thus, what one has to do at intermediate stages of the calculation is to 
work on the mass-shell 9 2 = M 2 , and then only take the limit M 0 after the 
physical quantities (such as the graviton-scattering amplitudes) are computed. 
This procedure can be shown to lead to the well-known graviton scattering 
amplitude results. 

Let us describe in a bit more detail how e.g. the 4-graviton scattering amplitude 
can be reproduced. As in the metric-based GR, one can show that the 4-valent 
interaction vertex cannot contribute to the answer (in the helicity basis). Thus, 
one has to analyse only the contribution from diagrams involving two 3-valent 
vertices. The 3-valent interaction vertex is of the schematic form (4.1). However, 
let us make the structure of the expression in (4.1) more explicit. The spinor 
formalism is again most suited for this purpose. So, we convert all space-time 
as well as internal indices into spinor ones. Recall that the connection becomes 
an object a ABCA with three unprimed and one primed spinor index. The relation 
(4.3) then reads: 

h-ABA'B' = -j^{da)ABA'B', (4.5) 

2 The helicity spinors can of course be built independently from any metric description, but the 
two ways of determining the helicity states agree. 
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where we do not worry about constants (which are in any case convention- 
dependent). We used the following notation for the spinor contractions: 

(da)ABA'B' ■= dfA' a B')EAB, (9a) abcd '■= 9 ] e'(ao>bcd)- (4-6) 

The second of these is to be used in the cubic interaction vertex below. In both 
expressions, here we assume the connection on the r.h.s. to take values in S\ <8> <S_, 
i.e. be totally symmetric in its three unprimed indices. As we have discussed in the 
section on linearized theory, this is the way to project out the components of the 
connection that are pure (diffeomorphism) gauge. The object on the left hand- 
side in (4.5) is then AB and A'B' symmetric, and is the spinor representation of 
the tracefree part of the graviton held h^. 

Let us also give the spinor expression for the 3-valent interaction vertex that 
one gets for the case of / corresponding to GR. Using the notations (4.6), we have: 

£(3) ~ ^ p ^) ABCD ^)ABE'F^a)cu E ' F ' + ..., (4.7) 

where the dots denote terms that go to zero in the limit M — > 0. Using this 
vertex (and appropriate spinor helicity states for the external gravitons), one 
can reproduce many of the GR graviton-scattering amplitudes. In particular, the 
usual 4-graviton scattering amplitude can be obtained, as well as the general 
MHV amplitude (it can be shown that only this vertex contributes to the 
MHV amplitudes). The latter is obtained using essentially the same Berends- 
Giele-type recursion relations (Berends & Giele (1988)) that work in the case 
of MHV amplitudes of gluon scattering in Yang-Mills theory. Details of these 
computations will appear in a future publication. 

Before we compare the above vertex with that in the case of metric-based GR, 
let us note that it is analogous to the cubic vertex of Yang-Mills theory (with 
the Lie algebra index suppressed) : 

4m ~ 9YU^a) AB a AEI a B E \ (4.8) 

where {Qo)ab '■= ^e'(ao>b) E ' is essentially the self-dual part of the curvature, and 
<?ym is the Yang-Mills coupling constant. The main difference between our gravity 
vertex (4.7) and the above Yang-Mills one is that the latter is renormalizable 
while the former is not. This is related to two extra derivatives being present in 
(4.7) when compared with (4.8). The point that we would like to emphasize 
however is that the gauge-theory formulation graviton interaction vertex is 
as simple as that of Yang-Mills theory, which gives hope that computational 
complexity can be greatly reduced using this formalism. 

Let us now quickly relate the vertex (4.7) to the cubic vertex in the metric- 
based formalism. We will only show the schematic form, because even the 
translation of (4.7) to the metric variables using the on-shell relations (4.3) and 
(4.4) is rather involved. We get: 

C®~±(#h)hh, (4.9) 
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which is the usual 3-valent vertex of the metric-based GR. In short, this is the 
explanation how such a seemingly different theory as the one with interactions 
(4.7) can reproduce the usual graviton-scattering amplitudes. 

Further, it is interesting to note that in the connection description, the parity 
invariance of GR does not become manifest. The parity invariance can be shown 
to hold (at least order by order in perturbation theory) for the theory (3.28). 
But it can also be seen to be explicitly violated by any other general member 
of the family (3.4). Thus, e.g. for the 4-graviton scattering in a generic theory, 
one finds that the all minus and all minus one plus amplitudes are zero, while 
the all plus and all plus one minus amplitudes are not. In the usual case, only 

the MHV amplitude (+ H ) is non-zero for the 4-graviton scattering. So, the 

general member of the deformations of GR family can be seen to violate parity 
quite explicitly. 

In preparation for a discussion of the quantum theory, let us briefly describe 
the structure of the interactions for a general /. In the case of / corresponding 
to GR, there are certain delicate cancellations which guarantee that there are no 
Tr((da)ABCD) 3 term in (4.7). For a general function /, there is no such cancellation, 
and the structure of the 3-vertex is 

^-^- 2 (da) AB CD (da) CD EF (da) EF AB 

+ ^ 2 {da) ABCD {Qa) ABE , F ,{da)cD E ' F ' + • • • • (4.10) 

Note that the quantity M p no longer appears, as there is no notion of the Planck 
mass for a general theory from the family (3.4). Indeed, recall that there are only 
dimensionless parameters in any of our theories (3.4). Thus, the quantities ci, C2 
above are dimensionless parameters related to the derivatives of the function / 
evaluated at the identity matrix, and M, as before, is the length scale introduced 
by the background. For the / corresponding to GR, we have c\ = 0, c 2 ~ M/M p , 
but for a general theory these are arbitrary dimensionless parameters. The 
relations (4.5) and (4.4) are still valid, and so the leading term of the 3-vertex 
can be rewritten in the metric terms as: 

^-^hf+^h)hh + .... (4.11) 

The point of this exercise is to exhibit explicitly that the interaction vertices 
of a general theory from the family (3.4) contain, when rewritten in the 
metric language, terms with more than two derivatives. In general, the n-valent 
vertex will start from a term (d 2 h) n ~ (Riemann) n when rewritten in the metric 
language. This fact will be important below when we discuss how the theories 
(3.4) behave under the renormalization. 

(b) Renormalization 

So far, we have only talked about the tree-level graviton-scattering amplitudes. 
These can be expected to agree with the results obtained in the usual metric- 
based approach (for theory (3.28)). However, as we have already discussed, the 
theory (3.28) is only on-shell equivalent to that described by the Einstein-Hilbert 
Lagrangian. Off-shell, the two actions have very different properties. Thus, one 
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can expect that the quantum theory based on (3.28) is different from that arising 
in the metric approach. More specifically, one should not expect the loop-corrected 
graviton-scattering amplitudes to be the same in the two approaches. 

With this general remarks being made, let us discuss the issue of non- 
renormalizability of GR. In the usual metric formalism, it manifests itself on-shell 
starting at the two-loop order (Goroff h Sagnotti 1986). At this order of 
perturbation theory, one finds it necessary to add a new (Riemann) 3 term to the 
Einstein-Hilbert Lagrangian to cancel the arising divergence. This is expected 
to continue at higher loop orders (even though nobody has calculated the 
3-loop divergences in pure gravity explicitly). It is thus believed that an 
infinite number of higher derivative terms has to be added to the Einstein- 
Hilbert Lagrangian in the process of its renormalization. This introduces an 
infinite number of new coupling constants, all becoming equally important 
around the Planck scale, and implying loss of predictive power of the theory 
at Planckian energies. 

Let us discuss how the non-renormalizability of gravity can manifest itself 
in the gauge-theoretic description. We have seen that (some of) the coupling 
constants of our theory, once the Lagrangian is expanded around a constant 
curvature background, are of negative mass dimension, as e.g. in the first term 
in (4.1). It is thus to be expected that higher derivative counter-terms will get 
produced in the process of renormalization of the loop corrections. However, 
unlike the case with the metric-based GR, where the perturbative expansion of the 
Lagrangian only contains two-derivative vertices, the vertices in our description 
already contain arbitrarily high powers of the derivative (see (4.2)). This fact will 
be very important below. 

Before we discuss implications of the higher derivative nature of our 
perturbative expansion, we emphasize that the appearance of all powers of the 
derivative does not mean that the theory is non-local. Indeed, as we have discussed 
in the previous section, the Lagrangian of the theory is nonlinear, but leads to 
not higher than second order in derivatives field equations. So, there is no non- 
locality in the theory, but, rather just nonlinearity that produces arbitrarily high 
powers of (9a) when the Lagrangian is expanded. 

The main point is then that, as arbitrarily high powers of the derivatives are 
already present in the expanded Lagrangian, the higher derivative terms that 
get produced in the process of the renormalization may already be contained in 
the original Lagrangian. Indeed, we have already seen how the 3- vertex (4.11) of 
the general theory, when rewritten in the metric language starts from the term 
(9 2 /i) 3 , which is nothing else but (Riemann) 3 . This is exactly the counter-term 
that is known (Goroff & Sagnotti (1986)) to be needed in the usual metric-based 
quantum gravity at two loops. Thus, at least to the lowest order, the idea that the 
terms that have to be added to our Lagrangian in the process of renormalization 
are already contained in it seems to hold. It is far from being trivial that 
this continues to happen to aU orders, but this can in principle happen, as it 
contradicts nothing we know about the behaviour of non-renormalizable theories. 
The important point here is not to see that even the higher derivative terms 
appear, for we have seen that the general n-valent vertex will start from the term 
(Riemann)". What would be non-trivial is that it is precisely the structure that 
can be encoded into a general Lagrangian of the form (3.4) that gets preserved by 
the process of the renormalization. For a more explicit calculation showing that 
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the (Riemann) 3 term that is known to arise in usual gravity at two-loops does 
appear in the perturbative expansion of a general Lagrangian from the family 
(3.4), we refer the reader to Krasnov (2010). 

To summarize, thanks to the peculiar higher derivative structure of the 
perturbatively expanded Lagrangian of our theory, it may be that the class of 
theories (3.4) is closed under the renormalization. As we have already said, this 
is far from being trivial, for we know that the class (3.4) is not the most general 
class of diffeomorphism-invariant gauge theories. Indeed, as we have discussed 
in the previous section, the theories that lead to higher than second-order held 
equations are not included into (3.4). We have then emphasized that 'not higher 
than second-order held equations' is not equivalent to 'not higher than two 
derivatives in the interaction vertices', for the nonlinearities of a second-derivative 
held equations Lagrangian can (and do) manifest themselves as higher derivative 
interactions. The conjecture is then that in the process of renormalization of 
(3.4), only similar nonlinearities appear, but no terms that change the order of 
the held equations. 

This conjecture was first stated in Krasnov (2006) in a different (Plebanski- 
like) formulation of this class of these theories. It is certainly non-trivial to prove, 
and the ongoing attempt is to compute the one-loop correction with the general 
Lagrangian (3.4) using the background held method. If it is found that at least at 
one-loop, one does not produce anything that is not already contained in (3.4), 
this would give a partial support to the conjecture. In the next subsection, we 
shall discuss a possible scenario for the ultraviolet (UV) behaviour of gravity in 
case the conjecture discussed in this subsection is correct. 

(c) Conjectural RG flow 

In this subsection, we shall assume that the class of theories (3.4) is closed 
under the renormalization and discuss what this implies for the problem of finding 
the UV completion of gravity. The closedness under the renormalization can be 
expressed as a property that the beta-function governing the renormalization 
group flow of the Lagrangian (3.4) is of the same form as the Lagrangian itself: 

df(F A F) 



9 log jJL 



■■Pj(FaF). (4.12) 



Here @f(F A F) is a function with the same properties as /, i.e. a gauge-invariant, 
and homogeneous function of degree 1 of a symemtric 3x3 matrix, which can 
thus be applied to a matrix-valued 4-form. The subscript / indicates that this 
function depends on the function / itself (i.e. the beta-function is a function of 
the coupling constants encoded in /). 

The UV behaviour of the theories (3.4) is then controlled by the UV fixed 
point (s) of the RG flow (4.12). One is then in the domain of applicability of 
the asymptotic safety ideas (see, Weinberg (2009) for a recent description). The 
idea of the asymptotic safety is that there is a UV-flxed point of the relevant 
RG flow (such as, e.g. (4.12)), with only a finite number of attractive directions. 
In other words, the idea is that most of the RG trajectories going to the UV 
approach the relevant fixed point, but then miss it and escape to infinity in the 
space of theories (coupling constants). And only a finite-dimensional subset of 
the trajectories actually end at the fixed point. The fixed point in question then 
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Figure 1. The conjectural RG flow. (Online version in colour.) 



provides the UV completion of the theory, and one gets predictive power from 
the requirement that the physical theory is on one of the trajectories that leads 
to this fixed point. Then, as the dimension of the surface of such trajectories is 
finite, one needs only a finite number of measurements to fix the theory. 

If the RG flow for the class (3.4) is of the form (4.12), the ideas of asymptotic 
safety can be applied directly. The flow is then essentially just a flow in the space 
of functions of two-variables f(X) = /(Ai, A2, A3) = Ai/(1, A2/A1, A3/A1), where we 
have used the fact that a gauge-invariant function of a symmetric 3x3 matrix 
X - ? is a function of its eigenvalues Ai, A2, A3, where X^ = 0diag(Ai, A2, A3) 0 T and 
O G SO(3) is the diagonalizing orthogonal transformation. Such a flow can be 
studied very explicitly, and the question of whether there are any UV- fixed points 
with a finite number of attractive directions becomes answerable. 

We then note that the ideas of asymptotic safety have a potential of explaining 
why it is the function (3.28) that appears as the low-energy Lagrangian. Indeed, 
one could envisage that being on a trajectory that emanates from the UV-fixed 
point necessarily implies that one flows to (3.28) at low energies. This way 
one could even explain the fact that the dimensionless constant in front of the 
GR Lagrangian (3.28), namely l/(16nGA) ~ 10 120 , is so large. Indeed, it is not 
impossible that the large numbers of this sort are produced by the RG flows. 
Thus, it may be that our scenario has something to say about the very difficult 
cosmological constant problem, which is to explain why such two different scales 
seem to coexist in Nature. 

Returning to the problem of the UV completion of gravity, we note that there 
is one point in the space of theories (3.4) that should certainly be a fixed point 
of the flow (4.12). This is the topological point ft op (X) =Tr(X). Indeed, while 
this point is not strictly speaking among the class of theories that we consider 
(dynamically non-trivial theories with two propagating degrees of freedom), it 
should certainly annihilate the beta-function 



The reason for this expectation is that a topological theory without any dynamics 
can have no non-trivial running, and this should imply (4.13). It can be 
conjectured that the fixed point (4.13) is the sought UV-fixed point. If this is 
the case, then the dynamics in the UV region is controlled by this fixed point, 
with the 'safe' theories approaching this fixed closer and closer as one increases 
the energy. The arising conjectural picture of the UV behaviour of gravity is then 
depicted in figure 1. 



% OP (^AF) = 0. 



(4.13) 
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5. Conclusions 

We have come a long way: from a simple Lagrangian (2.19) (see also (2.38)) that 
describes free gravitons in Minkowski space-time, to a possible scenario of the 
UV completion of quantum gravity. We have also seen how both spin 2 particles 
(gravitons) and spin 1 particles ('gluons') are simultaneously described by the 
same theory (3.4), once it is expanded around an appropriate background. Here 
we would like to briefly reiterate the main ideas of the approach of this paper, 
and then list questions that remain open. 

One of the main ideas was to use a gauge held rather than the metric as 
the main dynamical variable for gravity (in particular GR). We have seen how 
this is possible at both the linearized level (2.19), and, more non-trivially, at 
the full nonlinear level via theories (3.4) with (3.28). We have seen that gravity 
formulated in this language gets naturally embedded into a much larger class of 
theories (3.4), which we refer to as diffeomorphism-invariant gauge theories. The 
gravitational sector of such a general theory is then described by an appropriate 
SU(2) subgroup, while the other parts of the gauge group describe matter, in 
particular, the part of the gauge group that commutes with the gravitational 
SU(2) describing the Yang-Mills held. 

Our construction of the class of theories (3.4) seems to be a very natural one, 
as it only uses the principles of diffeomorphism and gauge-invariance (which 
are both known to be of fundamental importance) as the input, as well as 
the desire to have the held equations of not higher than second order in the 
derivatives. We note that, importantly, there are no dimensionful parameters 
in our class of theories, with the dimensionful constants arising only when the 
theory is perturbatively expanded. This feature of theories (3.4) of not having 
any dimensionful couplings is, in our opinion, a very desirable one for any would 
be fundamental theory. 

Importantly, our theories are nonlinear, while still leading to just second-order 
held equations. Nonlinearity is of course unavoidable in the real world, for without 
it there would be no interactions between particles. The most interesting theories 
that play a role in the description of Nature — Yang-Mills and GR — are both 
nonlinear. However, the nonlinearity of (3.4) is much stronger. In the case of 
Yang-Mills theory, the Lagrangian is polynomial in the fields (there are only 
up to quartic interaction vertices). In GR, non-linearity is worse, because once 
expanded (around a background) the action of GR contains vertices of arbitrarily 
high valency. Still, the order of the derivative that appears in GR vertices is 
always two (or smaller if one also takes into account the vertices coming from 
the cosmological term). The nonlinearity of our theories is similar to that in 
GR in that interaction vertices of arbitrary valency appear. However, it can be 
said to be stronger in the sense that up to the n th power of the derivative can 
appear at the n-valent vertex. On-shell, there is no contradiction between the 
two approaches. Indeed, as we have discussed, on-shell, the connection variable is 
related to the derivative of the metric (see (4.4)). This explains how what looks 
like an expansion in powers of 8 a can be rewritten as an expansion in powers 
of the metric perturbation. However, as we have discussed in detail in the first 
section of this paper, there can be no off-shell relation between the gauge-theoretic 
and the metric descriptions. Therefore, off-shell, and in particular, in quantum 
mechanical computations, one has to take the peculiar form of the perturbative 
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expansion (4.2) of the gauge-theoretic gravity Lagrangian seriously. It is then that 
the most interesting prospect, which may lead to an eventual UV completion of 
these theories, arises. 

Both GR and our gauge theories (3.4) are non-renormalizable. But in GR, the 
non-renormalizability manifests itself in a very drastic way with new terms being 
added to the Lagrangian at every loop order to cancel the arising divergences. 
These terms are of higher order in the derivatives, and for this reason they could 
not have been in the original second derivative Einstein-Hilbert Lagrangian. In 
the case of theories (3.4), we can also expect UV divergences that will have to 
be renormalized. However, the principal difference between (3.4) and the GR 
Lagrangian is that (3.4) already contains arbitrarily high powers of the derivative 
when perturbatively expanded. This comes from the nonlinearity of the theory, 
and, as we discussed above, does not contradict the fact that its held equations 
are second order. It may then be that the higher derivative counter-terms that 
are needed for the renormalization of a theory from the class (3.4) are of the same 
general form (3.4). In other words, as we have conjectured, it may be that the 
class (3.4) is closed under the renormalization. 

We then appealed to the ideas of asymptotic safety and gave a possible scenario 
for the UV-hxed point in this class of theories. Thus, the sought UV-hxed 
point that controls the dynamics of the theory at ultra-high energies may be 
the topological fixed point, corresponding to a theory without any propagating 
degrees of freedom. However, whatever the UV-hxed point may be, if the idea of 
the closedness of (3.4) under the renormalization is correct, the RG flow (4.12) has 
a potential of providing a very concrete realization of the asymptotic safety ideas. 
The RG flow also has a potential to explain why the dimensionless parameter in 
the Lagrangian of GR (3.28) is so large — which is a rephrasal of the infamous 
cosmological constant problem. 

While these possibilities are intriguing, many questions remain open and much 
more work is needed to see if the ideas described here can be realized. First and 
foremost, it is necessary to study how (3.4) gets renormalized in the quantum 
theory, and see if the expectation of closedness under the renormalization holds. 
This can be done via the standard textbook methods at least at one-loop 
level. The required Feynman rules will appear soon. Alternatively, one can 
use the more sophisticated technology of the background held method and 
the heat kernel expansion. Work on both of these lines of investigation is in 
progress. Once it is seen that the class (3.4) is preserved by the renormalization, 
the RG flow will be calculable and the ideas of asymptotic safety can 
be probed. 

Another very important line of research is that of finding which types of matter 
can naturally be coupled to gravity in this formalism. As we have discussed above, 
the only natural coupling in our approach appears to be by enlarging the gauge 
group from SU(2) that describes pure gravity to a general group G. It would be 
very interesting to see what type of matter can appear this way. The bosonic case 
has already been studied by Krasnov (2011c) and, in particular, it was found that 
massless gauge bosons and massive helds of various spins can be described. An 
interesting version of the spontaneous (gauge) symmetry-breaking scenario was 
also found in this context. It would be extremely interesting to see if the usual 
Dirac and Weyl fermions can be coupled to gravity by enlarging the gravitational 
SU(2) to some super-group. 
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Another important open problem is to understand the physical implications 
of the parity-violating character of the present class of theories. The question 
is already interesting in the context of pure gravity, for it can be, e.g. shown 
that the propagation of the gravitational waves on any time-dependent (but not 
conformafly flat) homogeneous isotropic background is left-right asymmetric in 
a general theory from the class (3.4). As it is easy to imagine that departures 
from GR described by (3.4) could be present in the very early Universe, it is 
interesting to determine whether there can be any observable effects of the parity 
asymmetry. This set of questions is also currently under investigation. 
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